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PERSEPHIN AND RELATED GROWTH FACTORS 

Reference -fco Government: Grant 

This invention was made with government support 
under Grant Numbers NS24679 and CA53524. The government 
5 has certain rights in this invention. 



Related Applications 

This application is a continuation-in-part of 
Application Serial No. 08/881,172 filed June 23, 1997 
10 which is a continuation-in-part of Application Serial No. 
08/615,944 filed March 14, 1996 and a continuation- in 
part of Application PCT/US97/03461 filed March 14, 1997. 

Background of the Invention 
15 (1) Field of the Invention 

This invention relates generally to trophic or 
growth factors and, more particularly, to novel growth 
factors of the neurturin-GDNF family of growth factors. 
(2) Description of the Related Art 
20 The development and maintenance of tissues in 

complex organisms requires precise control over the 
processes of cell proliferation, differentiation, 
survival and function. A major mechanism whereby these 
processes are controlled is through the actions of 
25 polypeptides known as "growth factors". These 

structurally diverse molecules act through specific cell 
surface receptors to produce these actions. 

Growth factors, termed "neurotrophic factors" 
promote the differentiation, growth and survival of 
30 neurons and reside in the nervous system or in innervated 
tissues. Nerve growth factor (NGF) was the first 
neurotrophic factor to be identified and characterized 
( Levi-Montalcini et al., J- Exp. Zool, 116:321, 1951 
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which is incorporated by reference)- NGF exists as a 
non-covalently bound homodimer that promotes the survival 
arid growth of sympathetic, neural crest -derived sensory, 
and basal forebrain cholinergic neurons. In sympathetic 
5 neurons this substance produces neurite outgrowth ±n 

vitro and increased axonal and dendritic growth in vivo. 
(See Levi -Mont alcini and Booker, Proc Nat ' 1 Acad Sci 
46:384-391, 1960; Johnson et al . Science 210: 916-918, 
1980; Crowley et al-. Cell 76:1001-12, 1994 which are 
10 incorporated by reference). NGF has effects on cognition 
and neuronal plasticity, and can promote the survival of 
neurons that have suffered damage due to a variety of 
mechanical, chemical, viral, and immunological insults 
(Snider and Johnson, Ann Neurol 26:489-506, 1989; Hefti, 
15 J Nenroblol 25:1418-35, 1994 which are incorporated by 
reference). NGF also is known to extensively interact 
with the endocrine system and in immune and inflammatory 
processes. (Reviewed in Scully and Otten, Cell Biol Int 
29:459-469, 1995; Otten and Gadient, Int. J. Devi 
20 Neuroscl 13:147-151, 1995 which are incorporated by 

reference). For example, NGF promotes the survival of 
mast cells. ( Horigome et al . J Biol Chem 269:2695-2707, 
1994 which is incorporated by reference). 

In recent years it has become apparent that growth 
25 factors fall into classes, i.e. families or superf amilies 
based upon the similarities in their amino acid 
sequences. These families include, for example, the 
fibroblast growth factor family, the neurotrophin family 
and the transforming growth factor-beta (TGF-B) family. 
30 As an example of family member sequence similarities, 

TGF-B family members have 7 canonical framework cysteine 
residues which identify members of this super family. 

NGF is the prototype of such a family of growth 
factors. Brain-derived neurotrophic factor (BDNF), the 
35 second member of this family to be discovered, was shown 
to be related to NGF by virtue of the conservation of all 
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six cysteines that form the three internal disulfides of 
the NGF monomer (Barde, Prog Growth Factor Res 2:237-248, 
1990 and Liebrock et al. Nature 341:149-152, 1989 which 
are incorporated by reference). By utilizing the 
5 information provided by BDNF of the highly conserved 
portions of two factors, additional members (NT-3, NT- 
4/5) of this neurotrophin family were rapidly found by 
several groups (Klein, FASEB J 8:738-44, 1994 which is 
incorporated by reference ) . 
10 Neurotrophic factors structurally unrelated to NGF 

have been recently identified. These include factors 
originally isolated based upon a "neurotrophic action" 
such as ciliary neurotrophic factor (CNTF) (Lin et al • , 
Science 246:1023-5 , 1989 which is incorporated by 
15 reference) along with others originally isolated as a 

result of non-neuronal activities (e.g. fibroblast growth 
factors (Cheng and Matt son Neuron 1:1031-41,1991 which is 
incorporated by reference), IGF-I (Kanje et al. Brain Res 
486:396-398, 1989 which is incorporated by reference) 
20 leukemia inhibitory factor (Kotzbauer et al. Neuron 
12:763-773, 1994 which is incorporated by reference). 

Glial-derived neurotrophic factor (GDNF), is one 
such neurotrophic factor structurally unrelated to NGF. 
GDNF was, thus, a unique factor, which, up until now, was 
25 not known to be a member of any subfamily of factors. 

The discovery, purification and cloning of GDNF resulted 
from a search for factors crucial to the survival of 
midbrain dopaminergic neurons, which degenerate in 
Parkinson's disease. GDNF was purified from rat B49 
30 glial cell conditioned media (Lin et al.. Science 

260:1130-2, 1993 which is incorporated by reference). 
Sequence analysis revealed it to be a distant member of 
the TGF-6 super family of growth factors, having 
approximately 20% identity based primarily on the 
35 characteristic alignment of the 7 canonical framework 
cysteine residues (Lin et al.. Science 260:1130-2, 1993 
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which is incorporated by reference). Thus, GDNF could 
possibly have represented a new subfamily within the TGF- 
6 super family* 

Recombinant GDNF produced in bacteria specifically 
5 promotes the survival and morphological differentiation 
of dopaminergic neurons (Lin et al • , Science 260:1130-2, 
1993); Tomac et al • , Nature 373:335-9, 1995; Beck et al.. 
Nature 373:339-41, 1995 and Ebendal et al., J Neuroscl 
Res 40:276-84, 1995 which are incorporated by reference) 
10 and motor neurons (Henderson et al.. Science 266:1062-4, 
1994; Yan et al,. Nature 373:341-4, 1995; and Oppenheim 
et al.. Nature 373:344-6, 1995 which are incorporated by 
reference). Overall, GDNF was a more potent factor for 
promoting the survival of motor neurons than the other 
15 factors, and it was the only factor that prevented 

neuronal atrophy in response to these lesions, thereby 
positioning it as a promising therapeutic agent for motor 
neuron diseases. 

It is now generally believed that neurotrophic 
20 factors regulate many aspects of neuronal function, 

including survival and development in fetal life, and 
structural integrity and plasticity in adulthood. Since 
both acute nervous system injuries as well as chronic 
neurodegenerative diseases are characterized by 
25 structural damage and, possibly, by disease- induced 

apoptosis, it is likely that neurotrophic factors play 
some role in these afflictions. Indeed, a considerable 
body of evidence suggests that neurotrophic factors may 
be valuable therapeutic agents for treatment of these 
30 neurodegenerative conditions, which are perhaps the most 
socially and economically destructive diseases now 
afflicting our society. Nevertheless, because different 
neurotrophic factors can potentially act preferentially 
through different receptors and on different neuronal or 
35 non-neuronal cell types, there remains a continuing need 
for the identification of new members of neurotrophic 
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factor families for use in the diagnosis and treatmeni: of 
a variety of acute and chronic diseases of the nervous 
system . 

5 Summarv of the Invention : 

Briefly, therefore, the present invention is 
directed to the identification and isolation of 
substantially purified factors that promote the survival 
and growth of neurons as well as non-neuronal cells. 
10 Accordingly, the inventors herein have succeeded in 

discovering novel protein growth factors belonging to a 
family of growth factors for which GDNF was the first 
known member. The first such newly discovered family 
member was neurturin and this is the subject of copending 
15 application serial Number 08/519,777. Based upon the 

sequence of GDNF and neurturin the inventors herein have 
discovered another member of the GDNF-Neurturin family of 
growth factors referenced herein as persephin ( PSP ) . 
This growth factor is believed to show at least 75% 
20 sequence identity among homologous sequences from 

different mammalian species although sequence homology 
may be as low as 65% in non-mammalian species such as 
avian species. Indeed, the mouse, rat and human mature 
persephin sequences show from about 80% to about 94% 
25 sequence identity. Mature persephin proteins identified 
herein comprise mouse sequences as set forth in SEQ ID 
NOS:79 and 187 (Figure 17B amino acids 66 through 154 and 
61 through 156, respectively), rat sequences as set forth 
in SEQ ID NOS:82 and 196 (Figure 18B amino acids 6 
30 through 94 and 1 through 96, respectively), and human 

sequences as set forth in SEQ ID N0S:221 and 223 (Figure 
24; amino acids 61-156 and 66-154, respectively). 

Persephin has been identified and obtained by a 
method based upon the conserved regions of the GDNF- 
35 Neurturin family discovered by the inventors herein. 

Accordingly, a new method has been devised that utilizes 
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degenera-te primers constructed from the sequences of 
these conserved regions for use in the polymerase chain 
reaction procedure. By utilizing this method the mouse, 
rat and human orthologs of the new family member, 
5 persephin, have been identified and obtained. 

The present invention thus provides both amino 
acid sequences and nucleotide sequences that encode 
mouse, rat and human persephin including amino acid 
sequences of SEQ ID NOS:79, 82, 187, 196, 221 and 223 and 
10 nucleotide sequences of SEQ ID NOS: 183, 193 and 199 and 
201 as well as the complements of such nucleotide 
sequences (SEQ ID NOS: 184, 194, 200 and 202 ) • In 
addition, the present invention includes pre-, pro- and 
prepro- regions as well as pre-pro persephin amino acid 
15 and nucleotide sequences. 

Expression vectors and stably transformed cells 
comprising persephin polynucleotides are also within the 
scope of this invention. The transformed cells can be 
used in a method for producing persephin. 
20 In another embodiment, the present invention 

provides a method for preventing or treating cellular 
degeneration comprising administering to a patient in 
need thereof a therapeutically effective amount of 
persephin. A patient may also be treated by implanting 
25 transformed cells which express persephin or a DNA 

sequence which encodes persephin into a patient, or cells 
cultured and expanded by growth in persephin. 

The present invention also provides compositions 
and methods for detecting persephin. One method is based 
30 upon persephin antibodies and other methods are based 
upon detecting mRNA or cDNA or genomic DNA encoding 
persephin using recombinant DNA techniques. 

In still further embodiments, the present 
invention includes pan-growth factors comprising a 
35 segment of a persephin sequence and a segment of at least 
one growth factor other than persephin. Also included 
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are polynucleotides encoding the pan-growth factors, 
vectors containing such polynucleotides and host cells 
comprising the polynucleotides. 

Among the several advantages found to be achieved 
5 by the present invention, therefore, may be noted the 
provision of a new growth factor, persephin, for use in 
preventing the atrophy, degeneration or death of certain 
cells, in particular neurons; the provision of human 
persephin; the provision of other members of the 

10 neurturin-persephin-GDNF family of growth factors by 

making available new methods capable of obtaining other 
family members; the provision of methods for obtaining 
persephin by recombinant techniques; the provision of 
methods for preventing or treating diseases producing 

15 cellular degeneration and, particularly neuronal 

degeneration; the provision of methods that can detect 
and monitor persephin levels in a patient; and the 
provision of methods that can detect alterations in the 
persephin gene . 

20 

Brief Description of the Drawings 

Figure 1 illustrates the purification scheme for 
preparing neurturin from CHO cells; 

Figure 2 illustrates the characterization of 

25 fractions eluted from Mono S column in purifying 

neurturin showing (a) electrophoresis of each fraction on 
a SDS-polyacrylamide gel and visualization of the 
proteins by silver stain and (b) the neurotrophic 
activity present in each fraction in the superior 

30 cervical ganglion survival assay; 

Figure 3 illustrates the ability of neurturin to 
maintain survival of superior cervical ganglionic cells 
in culture showing (a) positive control cells maintained 
with nerve growth factor (NGF) (b) negative control cells 

35 treated with anti-NGF antibodies showing diminished 
survival and (c) cells treated with anti-NGF and 
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neur-turin (approximately 3 ng/ml ) showing survival of 
neurons ; 

Figure 4 illustrates the concentration-response 
effect of neurturin in the superior cervical ganglion 
5 survival assay; 

Figure 5 illustrates the homology of the amino 
acid sequences for the mature growth factors, human 
neurturin (hNTN), mouse neurturin (mNTN), rat GDNF 
(rGDNF), mouse GDNF (mGDNF) and human GDNF (hGDNF) with 
10 identical amino acid residues enclosed in boxes; 

Figure 6 illustrates the tissue distribution of 
neurturin mRNA and the mRNA for GDNF using RT/PCR 
analysis on RNA samples obtained from embryonic day 21 
(E21) and adult rats; 
15 Figure 7 illustrates the cDNA and encoded amino 

acid sequence of human pre-pro neurturin (SEQ ID NO: 11) 
showing the pre- region from nucleic acid 1 through 57 
(SEQ ID NO: 17), the pro- region from nucleic acid 58 
through 285 (SEQ ID NO: 20), human neurturin from nucleic 
20 acid 286 through 591 (SEQ ID NO: 9) and the splice site 
between nucleic acids 169 and 170 which defines the 
coding sequence portion of two exons from nucleic acids 1 
through 169 (SEQ ID NO: 27) and 170 through 594 (SEQ ID 
NO: 28); 

25 Figure 8 illustrates the cDNA and encoded amino 

acid sequence of mouse pre-pro neurturin (SEQ ID NO: 12) 
showing the pre- region from nucleic acid 1 through 57 
(SEQ ID NO: 18), the pro- region from nucleic acid 58 
through 285 (SEQ ID NO: 21), mouse neurturin from nucleic 

30 acid 286 through 585 (SEQ ID NO: 10) and the splice site 
t>etween nucleic acids 169 and 170 which defines the 
coding sequence portion of two exons from nucleic acids 1 
through 169 (SEQ ID NO: 29) and 170 through 588 (SEQ ID 
NO: 30); 

35 Figure 9 illustrates the mouse cDNA sequence 

containing a 5* non-coding region (SEQ ID NO: 13) and a 3' 
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non-coding region (SEQ ID NO: 14) each of which are 
contiguous to the coding region of pre-pro neurturin; 

Figure 10 illustrates the percent neuronal 
survival in E18 rat nodose ganglia neurons treated 24 
5 hours post-plating for NTN, GDNF, BDNF, NGF and AMO; 

Figure 11 illustrates the nucleotide and amino 
acid sequence of murine persephin (SEQ ID NOS:79, 80 and 
81; amino acid residues 52 through 140, 47 through 142, 
and 9 through 142, respectively); 
10 Figure 12 illustrates the family member sequence 

identity in the region between the first and seventh 
canonical framework cysteine residues aligned beginning 
with the first canonical framework cysteine for murine 
GDNF ( SEQ ID NO: 87 ) , murine neurturin ( NTN ) ( SEQ ID 
15 NO: 88) and murine persephin ( PSP ) (SEQ ID NO: 89); 

Figure 13 illustrates the partial sequence of rat 
persephin cDNA (SEQ ID NO: 97) obtained by the technique 
of rapid amplification of cDNA ends; 

Figure 14 illustrates the partial sequence 
20 beginning with the first canonical framework cysteine for 
rat persephin (SEQ ID NO: 83) and the corresponding 
polynucleotide sequence (SEQ ID NO: 86); 

Figure 15 shows (A) the family member aligned 
partial amino acid sequences from the first through the 
25 seventh canonical framework cysteine residues 

illustrating family member sequence homology of the 
mature growth factors, human GDNF (SEQ ID NO: 240), rat 
GDNF (SEQ ID NO: 241), mouse GDNF (SEQ ID NO: 242), human 
neurturin (NTN (human); SEQ ID N0:31), mouse neurturin 
30 (NTN (mouse); SEQ ID NO: 32), rat persephin (PSP (rat); 
SEQ ID NO:79), and mouse persephin (PSP( mouse); SEQ ID 
NO: 82) in which boxes enclose the 28 conserved amino acid 
residues present in all and (B) the aligned sequences of 
mature mouse persephin (mPSP; SEQ ID NO: 187), mature rat 
35 persephin (mPSP; SEQ ID NO: 198) and mature human 
persephin (hPSP; SEQ ID NO: 221); 



9914235A1_I > 



wo 99/14235 



PCT/US98/19163 



10 

Figure 16 illustrates the sequences of TGF-B 
superfamily members aligned using the Clustal method, 
from the first canonical framework cysteine to the end of 
the sequence for transforming growth factor-61 ( TGFBl ) , 
5 transforming growth factor-B2 ( TGFS2 ) , transforming 

growth factor-B3 ( TGFB3 ) , inhibin 6 A (INH6A), inhibin B 
B ( INH6B ) , the nodal gene ( NODAL ) , bone morphogenetic 
proteins 2 and 4 (BMP2 and BMP4 ) , the Drosophlla 
decapentapleglc gene (dpp), bone morphogenetic proteins 
10 5-8 (BMP5, BMP6, BMP7 and BMP8 ) , the Drosophlla 60A gene 
family {60A), bone morphogenetic protein 3 (BMP3), the 
Vgl gene, growth differentiation factors 1 and 3 (GDFl 
and GDF3), dorsalin ( drsln ) , inhibin a (INHa), the MIS 
gene (MIS), growth factor 9 (GDF-9), glial-derived 
15 neurotropic growth factor (GDNF) and neurturin (NTN); 

Figure 17 illustrates (A) full length murine 
persephin gene (SEQ ID NO: 177) with arrows indicating an 
88 nt intron from positions 155-242 and (B) the 
nucleotide sequence of murine pre-pro persephin (SEQ ID 
20 NO: 179) with encoded amino acid sequence (SEQ ID NO: 185); 
Figure 18 illustrates (A) full length rat 
persephin gene (SEQ ID NO: 188) with arrows indicating an 
88 nt intron from positions 155-242 and (B) the 
nucleotide sequence of rat pre-pro persephin (SEQ ID 
25 NO: 190) with encoded amino acid sequence (SEQ ID NO: 196); 

Figure 19 illustrates a western blot analysis 
using ant i -persephin antibodies to detect persephin 
protein in cell lysates from COS monkey cells transfected 
with the murine persephin gene ( lane 2 ) or the rat 
30 persephin gene ( lane 3 ) compared to cells transfected 

with the non- recombinant vector alone (pCB6, lane 4) and 
the mature protein produced by E. Coll (lane 1); 

Figure 20 illustrates the murine chimeric 
molecules (A) PSP/NTN containing the persephin fragment 
35 (residues 1-63) and the neurturin fragment (residues 68- 
100) and (B) NTN/PSP containing the neurturin fragment 
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(residues 1-67) and the persephin fragment (residues 64- 
96 ) with the arrow indicating the crossover point in 
each; 

Figure 21 illustrates the survival promoting 
5 effect of persephin in murine embryonic day~14 

mesencephalic cells cultured for three days (a) in the 
absence of persephin where almost all of the cells are 
dead and (b) in the presence of persephin (100 ng/ml ) 
where substantial neuronal cell survival is evident; 

10 Figure 22 illustrates the survival promoting 

effect of persephin (PSP) in murine embryonic day-14 
mesencephalic cells compared to effects of neurturin 
(NTN) and GDNF, measured by the number of cells stained 
with tyrosine hydroxylase (TOH); 

15 Figure 23 illustrates RT/PCT survey for persephin 

expression in adult mouse tissues showing persephin 
expression by Kidney cells; and 

Figure 24 illustrates the cDNA sequence of human 
pre-pro persephin ( SEQ ID NO: 203) with two silent 

20 mutations indicated at positions 30 and 360 and the 
encoded amino acid sequence (SEQ ID NO: 217) with the 
first amino acid of the pro- region indicated by the 
double asterisks ( ** ) at amino acid position 24 and the 
first amino acid of mature human persephin indicated by 

25 the single asterisk (*) at amino acid position 61. 

Description of the Preferred Embodiments 

The present invention is based upon the 
identification, isolation and sequencing of a DNA 

30 molecule that encodes a new growth factor, persephin. 

Persephin promotes cell survival and, in particular, the 
survival of neuronal cells. Prior to this invention, 
persephin was unknown and had not been identified as a 
discrete biological substance nor had it been isolated in 

35 pure form. 
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The growth factor, neurturin (NTN) was identified 
and isolated as set forth in copending application Serial 
Number 08/519,777 filed August 28, 1995, which is 
incorporated in its entirety by reference- From the 
5 sequence of neurturin and the sequence of the closely 

related growth factor, glial-derived neurotrophic factor 
(GDNF), the inventors herein have devised and pursued 
strategies to find additional related factors. Neurturin 
is approximately 40% identical to GDNF, but less than 20% 
10 identical to any other member of the TGF-B super family. 

Together these two proteins define a new subfamily within 
the TGF-6 super family. Several sequence regions within 
neurturin and GDNF were identified that are highly 
conserved, such that they are likely to be present in any 
15 additional members of this subfamily. This sequence 

information can, therefore, be used to isolate previously 
unknown members of this subfamily by designing degenerate 
oligonucleotides to be used as either primers in PGR 
reactions or as probes in hybridization studies. 
20 Using the new degenerate primer PGR strategy 

described in Example 11 of copending application Serial 
Number 08/519,777, the inventors herein have succeeded in 
identifying a third factor, persephin, that is 
approximately 40-50% identical to both GDNF and 
25 neurturin. Primers corresponding to the amino acid 
sequence from conserved regions of neurturin and GDNF 
(SEQ ID NO: 42 and SEQ ID NO: 44) were used to amplify a 77 
nt fragment from rat genomic DNA. The resulting products 
were subcloned into the Bluescript KS plasmid and 
30 sequenced. The sequence of one of the amplified products 
predicted amino acid sequence data internal to the PGR 
primers that was different from that of GDNF or neurturin 
but had more than 20% identity with GDNF and neurturin, 
whereas the sequences of other amplified products we 
35 obtained corresponded to GDNF or neurturin, as would be 
expected. The 22 nucleotide sequence (SEQ ID NO: 90) was 
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then aligned with the rat sequences of GDNF and neurturin 
and found to be unique. This novel sequence, thus, 
suggested that we had identified a new family member 
referenced herein as persephin. 
5 To obtain additional persephin sequence 

information, primers containing the unique 22 nucleotide 
sequence of the amplified fragment were used in the rapid 
amplification of cDNA ends (RACE) technique (Frohman, 
M.A. Methods in Enzymology 218:340-356, 1993) using cDNA 
10 obtained from neonatal rat brain. An approximately 350 
nt fragment was obtained from this PGR reaction which 
constituted a partial rat persephin cDNA sequence of 
approximately 350 nucleotides (SEQ ID NO: 106). The 
predicted amino acid sequence of this cDNA was compared 
15 to that of GDNF and neurturin, and found to have 

approximately 40% identity with each of these proteins. 
Importantly, the characteristic spacing of the canonical 
framework cysteine residues in members of the TGF-p 
super family was present. Furthermore, in addition to the 
20 region of similarity encoded by the degenerate primers 
used to isolate persephin, another region of high 
homology shared between GDNF and neurturin, but absent in 
other members of the TGF-p super family, was also present 
in persephin 

25 

GDNF ACCRPVAFDDDLSFLDD ( aa 60-76) (SEQ ID NO: 98) 
NTN PCCRPTAYEDEVSFKDV ( aa 61-77) (SEQ ID N0:99) 

PSP PCCQPTSYAD-VTFLDD ( aa 57-72) (SEQ ID NO: 100) 

30 (Amino acid numbering uses the first Cys residue as amino 
acid 1). 

With the confirmation that persephin was indeed a 
new member of the GDNF/NTN subfamily, we isolated murine 
35 genomic clones of persephin to obtain additional sequence 
information. Primers corresponding to rat cDNA sequence 
were used in a PGR reaction to amplify a 155 nucleotide 
(nt) fragment from mouse genomic DNA which was homologous 
to the rat persephin cDNA sequence. These primers were 
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then used to obtain murine persephin genomic clones from 
a mouse 129/Sv library in a PI bacteriophage vector 
(library screening service of Genome Systems, Inc., St. 
Louis, MO ) . 

5 Restriction fragments (3.4 kb Nco I and a 3.3 kb Bam 

HI ) from this PI clone containing the persephin gene were 
identified by hybridization with a 210 nt fragment of 
persephin obtained by PGR using mouse genomic DNA and 
persephin- specific primers. The Nco I and Bam HI 
10 fragments were sequenced and found to encode a stretch of 
amino acids corresponding to that present in the rat 
persephin RACE product, as well as being homologous to 
the mature regions of both neurturin and GDNF (Figure 
11). 

15 Human persephin was obtained in a manner similar to 

that of mouse persephin. Degenerate PGR primers were 
used to amplify human genomic DNA and one clone was 
determined to have a sequence homologous to mouse 
persephin. Primers based upon the identified sequence 

20 were then used to screen cDNA libraries. Positive clones 
were identified by hybridization with a DNA probe derived 
from the identified sequence and these clones were then 
sequenced . 

When the amino acid sequences of mature murine GDNF, 
25 NTN and PSP are aligned using the first canonical 

framework cysteine as the starting point, which is done 
because alterations in the cleavage sites between family 
members creates variability in the segments upstream of 
the first cysteine, persephin (91 amino acids) is 
30 somewhat smaller than either neurturin (95 amino acids) 
or GDNF (94 amino acids). The overall identity within 
this region is about 50% with neurturin and about 40% 
with GDNF (Figure 12 

Further nucleotide sequencing of the murine 
35 persephin Ncol fragment revealed the nucleotide sequence 
of the entire murine persephin gene as shown in Figure 
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17. In addition, the entire rat persephin gene has been 
determined by sequencing a PGR amplified fragment of rat 
genomic DNA as shown in Figure 18. In both the murine 
and rat persephin gene, an open reading frame extends 
5 from the sequence coding for an initiator methionine up 
to a stop codon at positions 244-246. However, somewhere 
in this sequence an apparent anomaly was found to occur 
such that the sequence encoding the RXXR cleavage site 
(positions 257-268) and the sequence corresponding to the 

10 mature persephin protein (positions 269-556) are not co- 
linear with this open reading frame. Instead, a second 
reading frame encodes the cleavage site and the mature 
persephin. Irrespective of this apparent anomaly, 
mammalian cells were found to express persephin from 

15 either the murine or rat full length genomic sequence 
(see Example 14 below). 

To pursue the genesis of this anomaly, we prepared 
mammalian expression vectors for both murine and rat 
persephin. To construct the murine plasmid, a PI clone 

20 containing the murine persephin gene was used as a 

template in a PGR assay. Primers were designed such that 
the resulting fragment would contain the persephin gene 
extending from the initiator Methionine to the stop 
codon. The PGR reaction utilized a forward primer M3175 

25 [5 ' -TGGTGTGAGGATGGGTGGAGGAAGAGTTGGGA] and reverse primer 
M3 1 5 6 [ 5 • -CGGTACCCAGATCTTCAGCCACCAC AGCCACAAGC ] . To 
construct the analogous rat plasmid, rat genomic DNA was 
used as a template in a PGR assay. The PGR reaction 
utilized a forward primer M3175 [5'- 

30 TGCTGTCACCATGGCTGCAGGAAGACTTCGGA] and reverse primer 
M3 1 5 6 [ 5 ' - CGGT ACCCAGATCTTCAGCCACC ACAGCCACAAGC ] . The 
amplified products were cloned into BSKS and sequenced to 
verify that the correct clone had been obtained. The rat 
and murine persephin fragments were excised using Sma I 

35 and Hind III and cloned into a Asp718 (blunted) and Hind 
III sites of the mammalian expression vector pCB6. 
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COS monkey cells were transfected with either the 
rat or murine persephin expression vectors or the non- 
recombinant vector (pCB6) itself. Forty eight hr later 
the cells were lysed, the samples were loaded onto a 15% 
5 SDS-polyacrylamide gel, and the proteins were separated 
by electrophoresis. The proteins were then transferred 
to nitrocellulose by electroblotting • This 
nitrocellulose membrane was incubated with anti-persephin 
antibodies (which we raised to mature persephin produced 
10 in bacteria from a pET plasmid) to detect the presence of 
persephin in the lysates. Lysates from cells transfected 
with either the rat or murine persephin expression 
vectors, but not the lysate from cells transfected with 
pCB6, contain high amounts of persephin. The size of the 
15 persephin detected was 10-15 kD, consistent with the size 
predicted for the processed (i.e. mature form of 
persephin). Conditioned media harvested from these cells 
also contained mature persephin. These results 
demonstrate that both the murine and rat persephin genes 
20 are capable of directing the synthesis of a properly 
processed persephin molecule. 

To pursue the mechanism by which this occurred, we 
isolated RNA from cells transfected with either rat or 
murine persephin expression vector. RT/PCR analysis was 
25 performed using primers corresponding to the initiator 

Met and the stop codon. We detected two fragments: one 
corresponding to the predicted size of the persephin gene 
and the other somewhat smaller, suggesting that RNA 
splicing had occurred. We confirmed this with a number 
30 of other primer pairs. Both the large and small 

persephin fragments were cloned and sequenced. As 
expected, the larger fragment corresponded to the 
persephin gene. The small fragment corresponded to a 
spliced version of persephin. A small 88 nt intron 
35 within the pro-domain (situated 154 nt downstream of the 
start codon) had been spliced out. After this splicing 
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event, the " f rameshif t" was no longer present (i.e. the 
initiator Met and the mature region are in- frame) in 
either rat or mouse persephin. 

The N- terminus of persephin was predicted by 
5 reference to the N- terminal regions of neurturin or GDNF. 
Using neurturin sequence homology and cleavage signals, a 
characteristic RXXR cleavage motif is present beginning 9 
residues upstream of the first canonical framework 
cysteine of persephin which would suggest that mature 
10 murine persephin would contain 5 amino acids (ALAGS) ( SEQ 
ID NO: 103) upstream of this cysteine (as does neurturin). 
The corresponding 5 amino acids in rat persephin are 
ALPGL (SEQ ID NO: 112) and those in human persephin are 
ALSGP (SEQ ID NO: 224). Using these parameters, mature 
15 persephin would consist of 96 amino acids and have a 
predicted molecular mass of 10.4 kD. 

By "mature" growth factor reference is made to the 
secreted form of the growth factor in which any pre- or 
pro- regions have been cleaved and which may exist as a 
20 monomer or, by analogy to other members of the TGF-B 
super family, in the form of a homodimer linked by 
disulfide bonds • 

The discovery of the new growth factor, persephin, 
as described above is a result of the prior discovery by 
25 the inventors herein of neurturin. Thus, the experiments 
leading to the discovery of neurturin are relevant to the 
current discovery of persephin as well as to the 
biological activity of persephin. The isolation, 
identification and characterization of neurturin is 
30 described in detail in Examples 1-5 below* 

Reference to persephin herein is intended to be 
construed to include growth factors of any origin which 
are substantially homologous to and which are 
biologically equivalent to the persephin characterized 
35 and described herein. Such substantially homologous 
growth factors may be native to any tissue or species 
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and, similarly, biological activity can be characterized 
in any of a number of biological assay systems. 
Reference to pre-pro persephin is intended to be 
construed to include pre-pro growth factors containing a 
5 pre- or leader or signal sequence region, a pro- sequence 
region and persephin as defined herein. 

The terms "biologically equivalent" are intended to 
mean that the compositions of the present invention are 
capable of demonstrating some or all of the same growth 
10 promoting properties in a similar fashion, although not 
necessarily to the same degree as the recombinantly 
produced human, mouse or rat persephin as identified 
herein . 

By "substantially homologous" it is meant that the 

15 degree of sequence identity of persephin orthologs 
including human, mouse and rat persephin as well as 
persephin from any other species, is greater than that 
between paralogs such as persephin and neurturin or 
persephin and GDNF, and greater than that reported 

20 previously for members of the TGF-6 superfamily (For 

discussion of homology of TGF-S superfamily members see 
Kingsley, Genes and Dev 8:133-46, 1994 which is 
incorporated by reference). 

Sequence identity or percent identity is intended to 

25 mean the percentage of same residues between two 

sequences. The reference sequence is human persephin 
when determining percent identity with mouse or rat 
persephin and with the non-persephin growth factors, 
human neurturin or human GDNF. The reference sequence is 

30 mouse persephin when determining percent identity with 
mouse GDNF and mouse neurturin and rat persephin when 
determining percent identity with rat GDNF and rat 
neurturin. Referencing is to human neurturin when 
determining percent identity with non-human neurturin and 
35 to human GDNF. In all of the above comparisons, the two 
sequences being compared are aligned using the Clustal 
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method (Higgins et al, Cahlos 8:189-191, 1992) of 
multiple sequence alignment in the Lasergene biocomputing 
software (DNASTAR, INC, Madison, WI ) . In this method, 
multiple alignments are carried out in a progressive 
5 manner, in which larger and larger alignment groups are 
assembled using similarity scores calculated from a 
series of pairwise alignments. Optimal sequence 
alignments are obtained by finding the maximum alignment 
score, which is the average of all scores between the 
10 separate residues in the alignment, determined from a 
residue weight table representing the probability of a 
given amino acid change occurring in two related 
proteins over a given evolutionary interval. Penalties 
for opening and lengthening gaps in the alignment 
15 contribute to the score. The default parameters used 
with this program are as follows: gap penalty for 
multiple alignment = 10; gap length penalty for multiple 
alignment = 10; k- tuple value in pairwise alignment = 1; 
gap penalty in pairwise alignment = 3; window value in 
20 pairwise alignment « 5; diagonals saved in pairwise 

alignment = 5- The residue weight table used for the 
alignment program is PAM250 (Dayhoff et al,, in Atlas of 
Protein Sequence and Structure, Dayhoff, Ed., NBRF, 
Washington, Vol. 5, suppl. 3, p. 345, 1978). 
25 Percent conservation is calculated from the above 

alignment by adding the percentage of identical residues 
to the percentage of positions at which the two residues 
represent a conservative substitution (defined as having 
a log odds value of greater than or equal to 0-3 in the 
30 PAM250 residue weight table)- Using this criterion, 

preferred conservative amino acid changes are: R-K; E-D, 
Y_F, L-M; V-I, Q-H. Conservation is referenced to human 
persephin when determining percent conservation with 
persephin from other species or with non-persephin growth 
35 factors; referenced to human neurturin when determining 
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15 



percent conservation with non-human neurturin or with 
non-persephin, non-neurturin growth factors. 

Table 1 shows the calculations of identity ( I ) and 
conservation (C) for comparisons of mature persephin, 
mature neurturin and mature GDNF from various species. 
Comparisons were made between mature human persephin 
(hPSP) and mature mouse and rat persephin (mPSP and rPSP, 
respectively ) and between mature human persephin and 
mature human GDNF or neurturin ( hGDNF and hNTN 
respectively). Neurturin comparisons were between mature 
human and mature mouse neurturin (hNTN and mNTN, 
respectively) and between each of these and mature human, 
rat and mouse GDNF (hGDNF, rGDNF and mGDNF, respectively) 
as shown in the table- 

Table 1 . 



COMPARISON 



% IDENTITY % CONSERVATION 



20 



hPSP V. 
hPSP V. 
mPSP V. 
hPSP V. 
hPSP V. 



mPSP 
rPSP 
rPSP 
hNTN 
hGDNF 



81 
80 
94 
49 
40 



81 
81 
96 
50 
43 



25 



30 



hNTN V. 
hNTN V. 
hNTN V. 
hNTN V. 
mNTN V. 
mNTN V. 
mNTN V. 



mNTN 

rGDNF 

mGDNF 

hGDNF 

rGDNF 

mGDNF 

hGDNF 



90 
44 
43 
43 
42 
41 
41 



93 
53 
52 
53 
52 
51 
52 



35 The degree of homology between the human persephin 

and mouse or rat persephin is about 80% whereas the 
degree of homology between mouse and rat persephin is 
about 94%. The neurturin comparisons as shown in Table 1 
indicate mature mouse and human neurturin proteins have 

40 about 90% sequence identity. Furthermore, all persephin 
and neurturin homologs of non-human mammalian species are 
believed to similarly have at least about 75% sequence 
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identity with human persephin, human neurturin, or human 
GDNF. For non-mammalian species such as avian species, 
it is believed that the degree of homology with persephin 
is at least about 65% identity with human persephin, or 
5 neurturin human neurturin or human GDNF. By way of 

comparison, the variations between family members of the 
neurturin-persephin-GDNF family of growth factors can be 
seen by the comparison of persephin and GDNF or neurturin 
and GDNF. Human persephin has about 40% sequence 
10 identity and about 43% sequence conservation with human 
GDNF; and about 49% sequence identity and about 50% 
sequence conservation with human neurturin. Similarly, 
human neurturin has about 40% sequence identity and about 
50% sequence conservation with human GDNF. It is 
15 believed that the different family members also have a 
similar sequence identity of about 40% of that of 
neurturin, about 40% of that of persephin or about 40% of 
that of GDNF and within a range of about 30% to about 75% 
identity with neurturin, within a range of about 30% to 
20 about 75% identity with persephin or within a range of 
about 30% to about 75% sequence identity with GDNF. 

Thus, a given member of the GDNF-neurturin-persephin 
family would be expected to have lesser sequence identity 
with any other family member of the same species than is 
25 present in orthologs of that family member in other 

species just as human GDNF and human neurturin are more 
closely related to mouse GDNF and mouse neurturin, 
respectively, than to each other or to GDNF and any given 
family member would be expected to have greater sequence 
30 Identity with another family member than to any other 

known member of the TGF-6 superfamily (Kingsley, supra). 

Homologs of pre-pro persephin in non-human mammalian 
species can be identified by virtue of the persephin 
portion of the amino acid sequence having at least about 
35 75% sequence identity with human persephin and homologs 
of pre-pro persephin in non-mammalian species can be 
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identified by virtue of the persephin portion of the 
amino acid sequence having at least about 65% identity 
with human persephin* 

Persephin as used herein, can also include hybrid 
5 and modified forms of persephin, respectively, including 
fusion proteins and persephin fragments and hybrid and 
modified forms in which certain amino acids have been 
deleted or replaced and modifications such as where one 
or more amino acids have been changed to a modified amino 
10 acid or unusual amino acid and modifications such as 
glycosolations so long as the hybrid or modified form 
retains the biological activity of persephin. By 
retaining the biological activity, it is meant that 
neuronal survival is promoted, although not necessarily 
15 at the same level of potency as that of the human, mouse 
or rat persephin identified herein* 

Also included within the meaning of substantially 
homologous is any persephin which may be isolated by 
virtue of cross-reactivity with antibodies to the 
20 persephin or whose encoding nucleotide sequences 

including genomic DNA, mRNA or cDNA may be isolated 
through hybridization with the complementary sequence of 
genomic or subgenomic nucleotide sequences or cDNA of the 
persephin or fragments thereof. It will also be 
25 appreciated by one skilled in the art that degenerate DNA 
sequences can encode human persephin and these are also 
intended to be included within the present invention as 
are allelic variants of persephin. 

Conservatively substituted persephin proteins are 
30 also within the scope of the present invention. 

Conservative amino acid substitutions refer to the 
interchangeability of residues having similar side 
chains. Conservatively substituted amino acids can be 
grouped according to the chemical properties of their 
35 side chains. For example, one grouping of amino acids 
includes those amino acids have neutral and hydrophobic 
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side chains (A, V, L, I, P, F, and M); anot:her 

grouping is those amino acids having neutral and polar 
side chains (G, S, T, Y, C, N, and Q); another grouping 
is those amino acids having basic side chains (K, R, and 
5 H); another grouping is those amino acids having acidic 
side chains ( D and E ) ; another grouping is those amino 
acids having aliphatic side chains {G, A, V, L, and I); 
another grouping is those amino acids having aliphatic- 
hydroxyl side chains (S and T); another grouping is those 
10 amino acids having amine -containing side chains (N, Q, K, 
R, and H); another grouping is those amino acids having 
aromatic side chains (F, Y, and W); and another grouping 
is those amino acids having sulfur-containing side chains 
(C and M). Preferred conservative amino acid 
15 substitutions groups are: R-K; E-D, Y-F, L-M; V-I, and Q- 
H. In addition, Q-R-H and A-V are believed to be 
preferred substitutions for persephin inasmuch as they 
occur among the human, mouse and rat paralogs of 
persephin ( see Figure 15B ) . 
20 In the case of pre-pro neurturin, alternatively 

spliced protein products, resulting from an intron 
located in the coding sequence of the pro region, may 
exist. The intron is believed to exist in the genomic 
sequence at a position corresponding to that between 
25 nucleic acids 169 and 170 of the cDNA which, in turn, 
corresponds to a position within amino acid 57 in both 
the mouse and human pre-pro neurturin sequences ( see 
Figures 7 and 8)- Thus, alternative splicing at this 
position might produce a sequence that differs from that 
30 identified herein for human and mouse pre-pro neurturin 
(SEQ ID NO: 11 and SEQ ID NO: 12, respectively) at the 
identified amino acid site by addition and/or deletion of 
one or more amino acids. Any and all alternatively 
spliced pre-pro neurturin proteins are intended to be 
35 included within the terms pre-pro neurturin and, 

similarly, any and all alternatively spliced pre-pro 
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persephin proteins are also intended to be included 
within the terms pre--pro persephin as used herein. 

Although it is not intended that the inventors 
herein be bound by any theory, it is thought that the 
5 human, mouse and rat persephin proteins identified herein 
as well as homologs from other tissues and species may 
exist as dimers in their biologically active form in a 
manner consistent with what is known for other factors of 
the TGF-6 superfamily. 
10 In addition to homodimers, the monomeric units of 

the dimers of persephin can be used to construct stable 
growth factor heterodimers or he ter omul timers comprising 
at least one monomer unit derived from persephin. This 
can be done by dissociating a homodimer of persephin into 
15 its component monomeric units and reassociating in the 
presence of a monomeric unit of a second or subsequent 
homodimeric growth factor. This second or subsequent 
homodimer ic growth factor can be selected from a variety 
of growth factors including neurturin, GDNF, a member of 
20 the NGF family such as NOP, BDNF, NT- 3 and NT-4/5, a 

member of the TGF-6 superfamily, a vascular endothelial 
growth factor, a member of the CNTF/LIF family and the 
like. 

Growth factors are thought to act at specific 
25 receptors. For example, the receptors for TGF-B and 
activins have been identified and make up a family of 
Ser/Thr kinase transmembrane proteins (Kingsley, Genes 
and Dev 8:133-146, 1994; Bexk et al Nature 373:339-341, 
1995 which are incorporated by reference). In the NGF 
30 family, NGF binds to the TrkA receptor in peripheral 

sensory and sympathetic neurons and in basal forebrain 
neurons; BDNF and NT-4/5 bind to trkB receptors; and NT-3 
binds primarily to trkC receptors that possess a distinct 
distribution within the CNS (Tuszynski et al-, Ann Neurol 
35 35:S9-S12, 1994). Members of the persephin-neurturin- 

GDNF family also appear to act through specific receptors 
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having distinct distributions as has been shown for other 
growth factor families • Recently, it was shown that GDNF 
acts through a multicomponent receptor complex in which a 
transmembrane signal transducing component, the Ret 
5 tyrosine kinase protein (Ret PTK), is activated upon the 
binding of GDNF with another protein, called GDNF 
Receptor a (GDNFR-a) which has no transmembrane domain 
and is attached to the cell surface via a glycosyl- 
phosphatidylinositol (GPI) linkage (Durbec et al . , Nature 
10 381:789-793, 1996; Jing et al . , Cell 85:1113-1124, 1996; 
Treanor et al.. Nature 382:80-83, 1996; Trupp et al.. 
Nature 381:785-789, 1996, which are incorporated herein 
by reference). Furthermore, it has been shown that the 
signaling of neurturin and GDNF through the Ret tyrosine 
15 kinase receptor is mediated by a family of co- receptors, 
including the co-receptor protein GDNFR-a, also referred 
to as TrnRl, and the co-receptor protein, TrnR2, either 
of which can form a functional receptor complex with Ret 
for both neurturin and GDNF (Baloh et al . , Neuron 18:793- 
20 802, 1997 which is incorporated by reference). By 

forming heterodimers or heteromultimers of persephin and 
one or more other growth factors, the resultant growth 
factor would be expected to be able to bind to at least 
two distinct receptor types preferentially having a 
25 different tissue distribution. The resultant 

heterodimers or heteromultimers would be expected to show 
a different and, possibly, an enlarged spectrum of cells 
upon which it could act or to provide greater potency. 
It is also possible that the heterodimer or 
30 heteromultimer might provide synergistic effects not seen 
with homodimers or homomultimers . For example, the 
combination of factors from different classes has been 
shown to promote long-term survival of oligodendrocytes 
whereas single factors or combinations of factors within 
35 the same class promoted short-term survival (Barres et 
al.. Development 118:283-295, 1993). 
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Heterodimers can be formed by a number of met:hods. 
For example, homodimers can be mixed and subjected to 
conditions in which dissociation/unfolding occurs, such 
as in the presence of a dissociation/unfolding agent, 
5 followed by subjection to conditions which allow monomer 
reassociation and formation of heterodimers . 
Dissociation/unfolding agents include any agent known to 
promote the dissociation of proteins* Such agents 
include, but are not limited to, guanidine hydrochloride, 
10 urea, potassium thiocyanate, pH lowering agents such as 
buffered HCl solutions, and polar, water miscible organic 
solvents such as acetonitrile or alcohols such as 
propanol or isopropanol. In addition, for homodimers 
linked covalently by disulfide bonds as is the case with 
15 TGF~B family members, reducing agents such as 

dithiothreitol and 6-mercaptoethanol can be used for 
dissociation/unfolding and for reassociation/ref olding . 

Heterodimers can also be made by transfecting a cell 
with two or more factors such that the transformed cell 
20 produces heterodimers as has been done with the 
neurotrophins . (Heymach and Schooter, 3 Biol Chem 
27 01122.91 --1230 A, 1995) . 

Another method of forming heterodimers is by 
combining persephin homodimers and a homodimer from a 
25 second growth factor and incubating the mixture at 37 *C. 

When heterodimers are produced from homodimers, the 
heterodimers may then be separated from homodimers using 
methods available to those skilled in the art such as, 
for example, by elution from preparative, non-denaturing 
30 polyacrylamide gels. Alternatively, heterodimers may be 
purified using high pressure cation exchange 
chromatography such as with a Mono S cation exchange 
column or by sequential immunoaf f inity columns. 

It is well known in the art that many proteins are 
35 synthesized within a cell with a signal sequence at the 
N- terminus of the mature protein sequence and the protein 
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carrying such a leader sequence is referred to as a 
preprotein. The pre- portion of the protein is cleaved 
during cellular processing of the protein. In addition 
to a pre- leader sequence, many proteins contain a 
5 distinct pro sequence that describes a region on a 

protein that is a stable precursor of the mature protein. 
Proteins synthesized with both pre- and pro- regions are 
referred to as preproproteins . In view of the processing 
events known to occur with other TGF-B family members as 
10 well as the sequences determined herein, the inventors 
believe that the form of the persephin protein as 
synthesized within a cell is the pre-pro persephin. Human 
pre-pro persephin is believed to contain an N-terminal 23 
amino acid signal sequence (human pre- signal sequence, 
15 SEQ ID NO: 219, Figure 24, amino acids 1 through 23 

encoded by SEQ ID NOS:208 and 209, Figure 24, nucleic 
acids 1 through 69). It is known that the full length of 
a leader sequence is not necessarily required for the 
sequence to act as a signal sequence and, therefore, 
20 within the definition of pre- region of persephin is 

included fragments thereof, usually N-terminal fragments, 
that retain the property of being able to act as a signal 
sequence, that is to facilitate co-translational 
insertion into the membranes of one or more cellular 
25 organelles such as endoplasmic reticulum, mitochondria, 
golgi, plasma membrane and the like. 

The persephin signal sequence is followed by a pro- 
domain which contains an RXXR proteolytic processing site 
immediately before the N-terminal amino acid sequence for 
30 the mature persephin. (human pro- region sequence, SEQ ID 
NO: 220, Figure 24, amino acids 24 through 60 encoded by 
the nucleic acid sequence SEQ ID NO: 211, Figure 24 
nucleic acids 70 through 180). 

The persephin pre- and pro- regions together 
35 comprise a pre-pro sequence identified as the human pre- 
pro region sequence (SEQ ID NO: 219, Figure 24, amino 
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acids 1 through 60 encoded by SEQ ID N0S:213 and 215, 
nucleic acids 1 through 285). The pre- region sequences 
arid pro- region sequences as well as the pre-pro region 
sequences can be identified and obtained for non-human 
5 mammalian species and for non-mammalian species by virtue 
of the sequences being contained within the pre-pro 
persephin as defined herein. 

Using the above landmarks, the human persephin cDNA 
has a 471 bp open reading frame encoding a 156 amino acid 

10 long protein (predicted Mr 16.6kDA). Cleavage of the 23 
amino acid long predicted signal peptide will lead to a 
133 amino acid pro-persephin molecule (Mr 14,2 JcDa ) , 
Proteolytic cleavage of the pro-persephin at a RXXR 
consensus sequence should yield a 96 amino acid mature 

15 protein with a molecular weight of 10.3 kDa. The mature, 
secreted persephin molecule is likely to form a disulfide 
linked homodimer by analogy to other members of the TGF-6 
family. 

The nucleotide sequences of persephin pre- and/or 

20 pro- regions or similar regions that are believed to be 
associated with persephin DNA can be used to construct 
chimeric genes with the coding sequences of other growth 
factors or proteins. (Booth et al.. Gene 146:303-8, 
1994; Ibanez, Gene 146:303-8, 1994; Storici et al., FEES 

25 Letters 337:303-7, 1994; Sha et al J Cell Biol 114:827- 
839, 1991 which are incorporated by reference). Such 
chimeric proteins can exhibit altered production or 
expression of the active protein species. 

A preferred persephin according to the present 

30 invention is prepared by recombinant DNA technology 

although it is believed that persephin can be isolated in 
purified form from cell-conditioned medium as was done 
for neurturin. 

By "pure form" or "purified form" or "substantially 

35 purified form" it is meant that a persephin composition 
is substantially free of other proteins which are not 
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persephin. Preferably, a substantially purified 
persephin composition comprises at least about 50 percent 
persephin on a molar basis compared to total proteins or 
other macromolecular species present. More preferably, a 
5 substantially purified persephin composition will 

comprise at least about 80 to about 90 mole percent of 
the total protein or other macromolecular species present 
and still more preferably, at least about 95 mole percent 
or greater. 

10 Recombinant persephin may be made by expressing the 

DNA sequences encoding persephin in a suitable 
transformed host cell. Using methods well known in the 
art, the DNA encoding persephin may be linked to an 
expression vector, transformed into a host cell and 

15 conditions established that are suitable for expression 
of persephin by the transformed cell. 

Any suitable expression vector may be employed to 
produce recombinant human persephin such as, for example, 
the mammalian expression vector pCB6 (Brewer, Meth Cell 

20 Biol 43:233-245, 1994) or the E. coll pET expression 

vectors, specifically, pET-30a (Studier et al*. Methods 
Enzymol 185:60-89, 1990 which is incorporated by 
reference) both of which were used herein. Other 
suitable expression vectors for expression in mammalian 

25 and bacterial cells are known in the art as are 

expression vectors for use in yeast or insect cells. 
Baculovirus expression systems can also be employed. 

Persephin may be expressed in the monomeric units or 
such monomeric form may be produced by preparation under 

30 reducing conditions. In such instances refolding and 

renaturation can be accomplished using one of the agents 
noted above that is known to promote 

dissociation/association of proteins. For example, the 
monomeric form can be incubated with dithiothreitol 
35 followed by incubation with oxidized glutathione disodium 
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salt followed by incubation with a buffer containing a 
refolding agent such as urea, 

Persephin may exist as a dimer or other multimer and 
may be glycosylated or chemically modified in other ways. 

5 Mature human persephin contains no N- linked glycosylation 
sites (see Figure 15B and SEQ ID NO: 221). Potential O- 
linked glycosylation sites occur in mature human 
persephin at positions 3, 10, 12, 14, 24, 36, 43, 67, 70 
and 88 in SEQ ID NO: 221 (Figure 15B). 
10 As noted above, the human nucleic acid sequence 

suggests that persephin is initially translated as a pre- 
pro polypeptide and that proteolytic processing of the 
signal sequence and the "pro" portion of this molecule 
results in the mature sequence, referenced herein as 

15 "mature persephin" exists in human and in non-human 
species in homologous form. Therefore, persephin 
includes any and all "mature persephin" sequences from 
human and non-human species and any and all pre-pro 
persephin polypeptides that may be translated from the 

20 persephin gene. 

By analogy to the neurturin protein, it is possible 
that isoforms of persephin may exist. For example, 
different possible cleavage sites (such as RXXR sites) 
may be present in the pre-pro neurturin sequence so that 

25 more than one possible isoform of pre-pro neurturin may 
exist. Thus, the mature neurturin protein may have a 
variable number of amino acids preceding the first 
canonical cysteine. Such alternate cleavage sites could 
be utilized differently among different organisms and 

30 among different tissues of the same organism. The 

N- terminal amino acids preceding the first of the seven 
conserved cysteines in the mature forms of members of the 
TGF-6 family vary greatly in both length and sequence. 
Furthermore, insertion of a ten amino acid sequence two 

35 residues upstream of the first conserved cysteine does 

not affect the known biological activities of one family 
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member, dorsalin (Easier et al.. Cell 73:687-702, 1993). 
By analogy, it: is also possible that persephin proteins 
containing sequences of different lengths preceding the 
first canonical cysteine may exist or could be made and 
5 that these would retain their biological activity. 

The inventors herein believe that at a minimum the 
sequence of a persephin-neurturin-GDNF growth factor that 
will show biological activity will contain the sequence 
from the first through the seventh canonical cysteine. 
10 This sequence of human persephin is from cysteine 66 
through cysteine 154 as shown in Figure 24 (SEQ ID 
NO:223). The comparable sequence for murine persephin as 
shown in Figure 12 is from cysteine 1 through cysteine 87 
(SEQ ID NO: 79) and that for rat persephin as shown in 
15 Figure 14, from cysteine 1 through cysteine 87 

(identified as SEQ ID NO: 82). Thus, within the scope of 
persephin proteins of the present invention are amino 
acid sequences containing SEQ ID NO: 223, SEQ ID NO: 79 or 
SEQ ID NO: 82 and nucleic acid molecules containing 
20 sequences encoding these amino acid sequences. 

The present invention also encompasses nucleic acid 
molecules comprising sequences that encode mouse, rat and 
human persephin (Figures 11, 14 and 23). Also included 
within the scope of this invention are sequences that are 
25 substantially the same as the nucleic acid sequences 
encoding persephin. Such substantially the same 
sequences may, for example, be substituted with codons 
more readily expressed in a given host cell such as E. 
coll according to well known and standard procedures. 
30 Such modified nucleic acid sequences are included within 
the scope of this invention. 

Specific nucleic acid sequences can be modified by 
those skilled in the art and, thus, all nucleic acid 
sequences which encode for the amino acid sequences of 
35 pre-pro persephin or the pre- region or the pro- region 
of persephin can likewise be so modified. The present 
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invention also includes nucleic acid sequence having one 
or more substitutions, deletions or additions wherein the 
nucleic acid sequence will hybridize with a persephin 
nucleic acid sequences or complement thereof where 
5 appropriate. 

Specific hybridization is defined herein as the 
formation of hybrids between a polynucleotide (e.g. a 
persephin polynucleotide which may include one or more 
substitutions, deletions, and/or additions) and a 

10 specific reference polynucleotide (e.g. polynucleotides 
encoding mature persephin and having the sequences of SEQ 
ID NO: 183, 184, 194, 195, 199, 200, 201 or 202) wherein 
the polynucleotide preferentially hybridizes to the 
specific reference polynucleotide. For example, a 

15 polynucleotide encoding a mature persephin will 
specifically hybridize to a reference persephin 
polynucleotide (e.g. SEQ ID NO: 183, 184, 194, 195, 199, 
200, 201 or 202) and not to a reference neurturin 
polynucleotide (e.g. SEQ ID NO: 9 or 10 or sequences 

20 complementary thereto). Specific hybridization is 

preferably done under high stringency conditions which is 
well understood by those skilled in the art to be 
determined by a number of factors during hybridization 
and during the washing procedure, including temperature, 

25 ionic strength, length of time and concentration of 
formamide (see for example, Sambrook et al . , 1989, 
supra ) . 

The present invention also includes nucleic acid 
sequences which encode for polypeptides that have 
30 survival or growth promoting activity and that are 
recognized by antibodies that bind to persephin. 

The present invention also encompasses vectors 
comprising an expression regulatory element operably 
linked to any of the nucleic acid sequences included 
35 within the scope of the invention. This invention also 
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includes host cells of any variety that have been 
transformed with such vectors. 

Methods are also provided herein for producing 
persephin. Preparation can be by isolation from 
5 conditioned medium from a variety of cell types so long 
as the cell type produces persephin. A second and 
preferred method involves utilization of recombinant 
methods by isolating a nucleic acid sequence encoding 
persephin, cloning the sequence along with appropriate 
10 regulatory sequences into suitable vectors and cell 

types, and expressing the sequence to produce persephin. 

A mammalian gene family comprised of four 
neurotrophic factors has been identified including nerve 
growth factor (NGF), brain derived neurotrophic factor 
15 (BDGF), neurotrophin-3 (NT-3), and neurotrophin-4/5 (NT- 
4/5). These factors share approximately 60 percent 
nucleic acid sequence homology (Tuszynski and Gage, Ann 
Neurol 35 :S9-S12, 1994 which is incorporated by 
reference). The persephin protein and the neurturin 
20 protein display no significant homology to the NGF family 
of neurotrophic factors. Either persephin or neurturin 
shares less than about 20% homology with the TGF-fl 
super family of growth factors. However, both persephin 
and neurturin show approximately 40% sequence identity 
25 with GDNF and approximately 50% sequence identity with 
each other. In particular, the positions of the seven 
cysteine residues present in persephin, neurturin and 
GDNF are nearly exactly conserved. The inventors herein 
believe that other unidentified genes may exist that 
30 encode proteins that have substantial amino acid sequence 
homology to persephin, neurturin and GDNF and which 
function as growth factors selective for the same or 
different tissues and the same or different biological 
activities and may act at the same or different 
35 receptors. A different spectrum of activity with respect 
to tissues affected and/or response elicited could result 
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from pref erent:ial activation of different receptors by 
different family members as is known to occur with 
members of the NGF family of neurotrophic factors 
(Tuszynski and Gage, 1994, supra). 
5 As a consequence of members of a particular gene 

family showing substantial conservation of amino acid 
sequence among the protein products of the family 
members, there is considerable conservation of sequences 
at the DNA level. This forms the basis for a new 

10 approach for identifying other members of the gene family 
to which GDNF, neurturin and persephin belong. The 
method used for such identification is cross- 
hybridization using nucleic acid probes derived from one 
family member to form a stable hybrid duplex molecule 

15 with nucleic acid sequence from different members of the 
gene family or to amplify nucleic acid sequences from 
different family members, (see for example, Kaisho et al. 
FEES Letters 266:187-191, 1990 which is incorporated by 
reference). The sequence from the different family 

20 member may not be identical to the probe, but will, 
nevertheless be sufficiently related to the probe 
sequence to hybridize with the probe. Alternatively, PGR 
using primers from one family member can be used to 
identify additional family members. 

25 The above approaches have not heretofore been 

successful in identifying other gene family members 
because only one family member, GDNF was known. With the 
identification of neurturin in copending application 
Serial No. 08/519,777, however, unique new probes and 

30 primers were made that contain sequences from the 
conserved regions of this gene family. The same 
conserved regions are also found in the third family 
member, persephin. In particular, three conserved 
regions have been identified herein which can be used as 

35 a basis for constructing new probes and primers. The new 
probes and primers made available from the work with 
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neurturin and persephin make possible this powerful new 
approach which can now successfully identify other gene 
family members. Using this new approach, one may screen 
for genes related to GDNF, neurturin and persephin in 
5 sequence homology by preparing DNA or RNA probes based 
upon the conserved regions in the GDNF and neurturin 
molecules. Therefore, one embodiment of the present 
invention comprises probes and primers that are unique to 
or derived from a nucleotide sequence encoding such 
10 conserved regions and a method for identifying further 
members of the neurturin-persephin-GDNF gene family. 

Conserved -region amino acid sequences have been 
identified herein to include Val-Xaai-Xaa2-Leu-Gly~Leu- 
Gly-Tyr where Xaa^ is Ser, Thr or Ala and Xaa2 is Glu or 
15 Asp (SEQ ID NO: 108); Glu-Xaai-Xaa2-Xaa3-Phe-Arg-Tyr-Cys~ 

Xaa4-Gly-Xaa5-Cys in which Xaai is Thr, Glu or lys, Xaa2 is 
Val, Leu or lie, Xaag is Leu or lie, Xaa4 is Ala or Ser, 
and Xaag is Ala or Ser, (SEQ ID NO: 113); and Cys-Cys-Xaai- 
Pro-Xaa2-Xaa3-Xaa4-Xaa5-Asp-Xaa6-Xaa7-Xaae-Phe-Leu-Asp-Xaa9 
20 in which Xaa^ is Arg or Gin, Xaa2 is Thr or Val or lie, 
Xaaa is Ala or Ser, Xaa^ is Tyr or Phe, Xaas is Glu, Asp 
or Ala, Xaa^ is Glu, Asp or no amino acid, Xaa7 is val or 
leu, Xaae is Ser or Thr, and Xaa^ is Asp or Val (SEQ ID 
NO: 114). Nucleotide sequences containing a coding 
25 sequence for the above conserved sequences or fragments 
of the above conserved sequences can be used as probes. 
Exemplary probe and primer sequences encoding amino acid 
sequences and SEQ ID NOS: 125-129; primers whose reverse 
complementary sequences encode amino acid sequences SEQ 
30 ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 130; and, in 

particular, nucleotide sequences, SEQ ID NOS: 115-124. 
Additional primers based upon GDNF and neurturin include 
nucleic acid sequences encoding amino acid sequences, SEQ 
ID NO: 33, SEQ ID NO: 36, SEQ ID NO: 40 and SEQ ID NO: 41; 
35 primers whose reverse complementary sequences encode SEQ 
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ID NO: 37, SEQ ID NO: 38 and SEQ ID NO 39; and, in 
particular, nucleic acid sequences, SEQ ID NOS: 42-48. 

Hybridization using the new probes from conserved 
regions of the nucleic acid sequences would be performed 
5 under reduced stringency conditions. Factors involved in 
determining stringency conditions are well known in the 
art (for example, see Sambrook et al.. Molecular Cloning, 
2nd Ed., 1989 which is incorporated by reference). 
Sources of nucleic acid for screening would include 
10 genomic DNA libraries from mammalian species or cDNA 

libraries constructed using RNA obtained from mammalian 
cells cloned into any suitable vector. 

PGR primers would be utilized under PGR conditions 
of reduced annealing temperature which would allow 
15 amplification of sequences from gene family members other 
than GDNF, neurturin and persephin. Sources of nucleic 
acid for screening would include genomic DNA libraries 
from mammalian species cloned into any suitable vector, 
cDNA transcribed from RNA obtained from mammalian cells, 
20 and genomic DNA from mammalian species. 

DNA sequences identified on the basis of 
hybridization or PGR assays would be sequenced and 
compared to GDNF, neurturin and persephin. The DNA 
sequences encoding the entire sequence of the novel 
25 factor would then be obtained in the same manner as 

described herein. Genomic DNA or libraries of genomic 
clones can also be used as templates because the 
intron/exon structures of GDNF and neurturin are 
conserved and coding sequences of the mature proteins are 
30 not interrupted by introns. 

Using this approach as described above, the primers 
designed from the conserved regions of neurturin and GDNF 
have been used to identify and obtain the sequence of the 
new family member described herein, persephin. 
35 Degenerate primers designed from persephin, neurturin and 
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GDNF can be further used to identify and obtain 
additional family members. 

It is believed that all GDNF-neurturin-persephin 
family members will have a high degree of sequence 
5 identity with one or more of the three identified family- 
member consensus regions in the portion of the sequence 
between the first and seventh canonical framework 
cysteines (see Figure 12). In particular, a new family 
member is anticipated to have at least a 62.5% identity 
10 with the consensus region octapeptide, Val-Xaa^-Xaaj-Leu- 
Gly-Leu-Gly-Tyr where Xaa^ is Ser, Thr or Ala and Xaaj is 
Glu or Asp (SEQ ID NO: 108) or at least a 62.5 percent 
sequence identity with the consensus region octapeptide, 
Phe-Arg-Tyr-Cys-Xaai-Gly-Xaa2-Cys where Xaa^ and Xaa2 are 
15 alanine or serine (SEQ ID NO: 109) or at least a 50 
percent sequence identity with the consensus region 
octapeptide, Asp-Xaai-Xaa2-Xaa3-Phe~Leu-Asp-Xaa4 where Xaa^ 
is aspartic acid or glutamic acid or no amino acid, Xaaj 
is valine or leucine, Xaa3 is serine or threonine; and 
20 Xaa4 is valine or aspartic acid (SEQ ID NO: 110). The 

inventors herein believed that any new family member will 
have 28 amino acids in the aligned sequence between the 
first and seventh canonical framework cysteine residues 
as set forth in Figure 15 with residues numbered from the 
25 N-terminal end of the family member aligned sequence 
being (1) Cys, (3) Leu, (10) Val, (13) Leu, (14) Gly, 
(15) Leu, (16) Gly, (17) Tyr, (21) Glu, (25) Phe, (26) 
Arg, (27) Tyr, (28) Cys, (30) Gly, (32) Cys, (44) Leu, 
(47) Leu, (58) Cys, (59) Cys (61) Pro, (66) Asp, (69) 
30 Phe, (70) Leu, (71) Asp, (83) Ser, (84) Ala, (87) Cys, 
and (89) Cys, however, it is possible that there may be 
as many as three mismatches. 

On the basis of the structural similarities of 
persephin to the sequences of neurturin and GDNF, 
35 persephin would be expected to promote the survival and 
growth of neuronal as well as non-neuronal cells. For 
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example, neurturin has been shown to promote the survival 
of superior cervical ganglion cells as well as nodose 
sensory ganglia neurons (see Examples 1-3). Furthermore, 
GDNF has been shown to act on dopaminergic, sympathetic, 
5 motor and several sensory neurons ( Henderson et al • 

supra, 1994; Miles et al, J Cell Biol 130:137-148, 1995; 
Yan et al. Nature 373:341-344, 1995; Lin et al. Science 
260:1130-1132, 1993; Trupp et al, J Cell Biol 130:137- 
148, 1995; Martin et al Brain Res 683:172-178 , 1995; 

10 Bowenkamp st al J Comp Nexxrol 355:479-489, 1995 which are 
incorporated by reference). Moreover, all other growth 
factors isolated to date have been shown to act on many 
different cell types (for example see Scully and Otten, 
Cell Biol int 19:459-469, 1005; Hefti, Neurotrophic 

15 Factor Therapy 25:1418-1435, 1994 which are incorporated 
by reference). Thus, it is likely that persephin will 
show activity on a variety of different neuronal cells, 
both peripheral and central, as well as on non-neuronal 
cells. With respect to peripheral neuronal cells, the 

20 profile of cells for which persephin will show a survival 
promoting activity appears to be different from that of 
neurturin or GDNF. In contrast to the survival-promoting 
activity produced by neurturin and GDNF in sympathetic 
and sensory neurons, persephin showed no activity in 

25 these tissues at concentrations tested. Nevertheless, 
persephin showed survival -promoting activity in 
mesencephalic cells obtained from rat embryo brains. 
Furthermore, persephin activity on any particular target 
cell type can be determined by routine experimentation 

30 using standard reference models. Moreover, the inventors 
herein have identified brain and heart tissues as tissues 
expressing persephin, which further supports the 
conclusion that persephin can act to promote survival and 
growth in a variety of neuronal and non-neuronal cells. 

35 As an example of the actions of neurotrophic factors 

on non-neuronal tissues, the prototypical neurotrophic 
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factor, NGF, also acts upon mast cells to increase their 
number when injected into newborn rats (Aloe, J 
N euro Immunol 18:1-12, 1988). In addition, mast cells 
express the trk receptor and respond to NGF such that NGF 
5 is a mast cell secretogogue and survival promoting factor 
(Horigome et al-, J Biol Chem 269:2695-2707, 1994 which 
is incorporated by reference). Moreover, members of the 
TGF-B superfamily act on many cell types of different 
function and embryologic origin. 

10 The inventors herein have identified brain and heart 

as tissues in which persephin is expressed and it is 
further believed that persephin is expressed in a number 
of other neuronal and non-neuronal tissues. The related 
family member, neurturin, is expressed in a number of 

15 non-neuronal tissues including blood, bone marrow, 

neonatal liver and mast cells. This suggests a role for 
neurturin in hematopoiesis , inflammation, allergy, and 
cardiomyopathy. Similarly, persephin may also have a 
similar profile of activity- 

20 Neurotrophic factors of the NGF family are thought 

to act through factor- specif ic high affinity receptors 
(Tuszynski and Gage, 1994, supra). Only particular 
portions of the protein acting at a receptor site are 
required for binding to the receptor. Such particular 

25 portions or discrete fragments can serve as an agonist 
where the substance activates a persephin receptor to 
elicit the promoting action on cell survival and growth 
and antagonists to persephin where they bind to, but do 
not activate, the receptor or promote survival and 

30 growth. Such portions or fragments that are agonists and 
those that are antagonists are also within the scope of 
the present invention. 

Synthetic, pan-growth factors can also be 
constructed by combining the active domains of persephin 

35 with the active domains of one or more other non- 

persephin growth factors. (For example, see Hag et al.. 
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Proc Nat'l Acad Sci 92:607-611, 1995 which is 
incorporated by reference). These pan-growth factors 
would be expected to have the combined activities or 
other advantageous properties of persephin and the one or 
5 more other growth factors. As such, these pan-growth 
factors are believed to be potent and multispecif ic 
growth factors that are useful in the treatment of a wide 
spectrum of degenerative diseases and conditions 
including conditions that can be treated by any or all of 

10 the parent factors from which the active domains were 
obtained. Such pan-growth factors might also provide 
synergistic effects beyond the activities of the parent 
factors ( Barres et al . , supra ) . 

Pan-growth factors within the scope of the present 

15 invention can include chimeric or hybrid polypeptides 
that are constructed from portions of fragments of at 
least two growth factors. Growth factors of the TGF-6 
superfamily are structurally related having highly 
conserved sequence landmarks whereby family members are 

20 identified. In particular, seven canonical framework 

cysteine residues are nearly invariant in members of the 
superfamily (Kingsley, Genes & Dev 8:133-146, 1994 which 
is incorporated by reference )( see Figure 17). Chimeric 
polypeptide molecules can, therefore, be constructed from 

25 a sequence that is substantially identical to a portion 
of the persephin molecule, up to one or more crossover 
points, and one or more sequences each of which is 
substantially identical with a portion of another TGF-B 
superfamily member extending on the other side of the 

30 corresponding one or more crossover points. For example, 
a portion of the amino terminal end of the persephin 
polypeptide can be combined with a portion of the carboxy 
terminal end of a neurturin polypeptide or alternatively 
a portion of the amino terminal end of a neurturin 

35 polypeptide can be combined with a portion of the carboxy 
terminal end of a persephin polypeptide. Such portions 
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of persephin or neurt:urin polypeptides are preferably 
from about 5 to about 95, more preferably from about 10 
to about 90, still more preferably from about 20 to about 
80 and most preferably from about 30 to about 70 
5 contiguous amino acids and such portions of another, non- 
persephin or, as the case may be, non-neurturin TGF-B 
super family member are preferably from about 5 to about 
95, more preferably from about 10 to about 90, still more 
preferably from about 20 to about 80 and most preferably 
10 from about 30 to about 70 contiguous amino acids. For 
example, a particular crossover point might be between 
the third and fourth canonical framework cysteine 
residues. One such exemplary construct would contain at 
the 5 ' end a sequence comprised of a persephin sequence 
15 from residue 1 through the third canonical framework 
cysteine residue 37 and up to a cross -over point 
somewhere between residue 37 and residue 63 but not 
including the fourth canonical framework cysteine residue 
64 (for reference, see mature persephin, SEQ ID NO: 80). 
20 The 3' end of the hybrid construct would constitute a 
sequence derived from another TGF-B superfamily member 
such as, for example, neurturin which is another TGF-B 
superfamily member that is closely related to persephin. 
Using neurturin as the other TGF-6 family member, the 
25 hybrid construct beyond the crossover point would be 
comprised of a sequence beginning at the desired 
crossover point in the neurturin sequence between the 
third canonical framework cysteine residue 37 and the 
fourth canonical framework cysteine residue 67 of 
30 neurturin and continuing through residue 100 at the 3 ' 
end of neurturin (for alignment, see figure 12). A 
second exemplary hybrid construct would be comprised of 
residue 1 through a crossover point between residues 37 
and 67 of neurturin contiguously linked with residues 
35 from the crossover point between residues 37 and 64 

through residue 96 of persephin. The above constructs 
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with persephin and neurturin are intended as examples 
only with the particular TGF-B family member being 
selected from family members including but not limited to 
transforming growth factor-Bl ( TGF61 ) , transforming 
5 growth factor-B2 (TGFB2), transforming growth factor-B3 
(TGFB3), inhibin B A (INHBA), inhibin B B (INHBB), the 
nodal gene { NODAL ) , bone morphogenetic proteins 2 and 4 
(BMP2 and BMP4 ) , the Drosophtla decapentapleglc gene 
(dpp), bone morphogenetic proteins 5-8 (BMP5, BMP6, BMP7 
10 and BMP8), the Drosophlla 60A gene family (60A), bone 
morphogenetic protein 3 ( BMP3 ) , the Vgl gene, growth 
differentiation factors 1 and 3 (GDFl and GDF3 ) , dorsalin 
(drsln), inhibin a (INHa), the MIS gene (MIS), growth 
factor 9 (GDF-9), glial-derived neurotropic growth factor 
15 (GDNF), neurturin (NTN) and persephin (see Figure 16). 
In addition, the crossover point can be any residue 
between the first and seventh canonical framework 
cysteines molecules of neurturin and the particular other 
family member. Furthermore, additional crossover points 
20 can be used to incorporate any desired number of 
persephin portions or fragments with portions or 
fragments of any one or more other family members. 

In constructing a particular chimeric molecule, the 
portions of persephin and portions of the other, non- 
25 persephin growth factor are amplified using PGR, mixed 

and used as template for a PGR reaction using the forward 
primer from one and the reverse primer from the other of 
the two component portions of the chimeric molecule. 
Thus, for example a forward and reverse primers are 
30 selected to amplify the portion of persephin from the 
beginning to the selected crossover point between the 
third and fourth canonical cysteine residues using a 
persephin plasmid as template. A forward primer with a 
5 ' portion overlapping with the persephin sequence and a 
35 reverse primer are then used to amplify the portion of 

the other, non-persephin growth factor member of the TGF- 
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B superfamily from the corresponding crossover point 
through the 3 ' end using a plasmid template containing 
the coding sequence for the non-persephin TGF-B family 
member. The products of the two PGR reactions are gel 
5 purified and mixed together and a PGR reaction performed. 
Using an aliquot of this reaction as template a PGR 
reaction is performed using the persephin forward primer 
and the reverse primer for the non-persephin growth 
factor. The product is then cloned into an expression 

10 vector for production of the chimeric molecule. 

Chimeric growth factors would be expected to be 
effective in promoting the growth and development of 
cells and for use in preventing the atrophy, degeneration 
or death of cells, particular in neurons. The chimeric 

15 polypeptides may also act as receptor antagonists of one 
or both of the full length growth factors from which the 
chimeric polypeptide was constructed or as an antagonist 
of any other growth factor that acts at the same receptor 
or receptors . 

20 The present invention also includes therapeutic or 

pharmaceutical compositions comprising persephin in an 
effective amount for treating patients with cellular 
degeneration or dysfunction and a method comprising 
administering a therapeutically effective amount of 

25 persephin. These compositions and methods are useful for 
treating a number of degenerative diseases. Where the 
cellular degeneration involves neuronal degeneration, the 
diseases include, but are not limited to peripheral 
neuropathy, amyotrophic lateral sclerosis, Alzheimer's 

30 disease, Parkinson's disease, Huntington's disease, 

ischemic stroke, acute brain injury, acute spinal chord 
injury, nervous system tumors, multiple sclerosis, 
peripheral nerve trauma or injury, exposure to 
neurotoxins, metabolic diseases such as diabetes or renal 
35 dysfunctions and damage caused by infectious agents. In 
particular, the ability of persephin to promote survival 
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in mesencephalic cells suggests an applicability of this 
growth factor in treating neuronal degenerative diseases 
of the CNS such as Parkinson's disease. 

Where the cellular degeneration involves bone marrow 
5 cell degeneration, the diseases include, but are not 

limited to disorders of insufficient blood cells such as, 
for example, leukopenias including eosinopenia and/or 
basopenla, lymphopenia, monocytopenia, neutropenia, 
anemias, thrombocytopenia as well as an insufficiency of 

10 stem cells for any of the above. The cellular 

degeneration can also involve myocardial muscle cells in 
diseases such as cardiomyopathy and congestive heart 
failure. The above cells and tissues can also be treated 
for depressed function. 

15 The compositions and methods herein can also be 

useful to prevent degeneration and/or promote survival in 
other non-neuronal tissues as well. One skilled in the 
art can readily determine using a variety of assays known 
in the art for identifying whether persephin would be 

20 useful in promoting survival or functioning in a 
particular cell type. 

In certain circumstances, it may be desirable to 
modulate or decrease the amount of persephin expressed. 
Thus, in another aspect of the present invention, 

25 persephin ant i -sense oligonucleotides can be made and a 
method utilized for diminishing the level of expression 
of persephin, respectively, by a cell comprising 
administering one or more persephin anti-sense 
oligonucleotides. By persephin anti-sense 

30 oligonucleotides reference is made to oligonucleotides 
that have a nucleotide sequence that interacts through 
base pairing with a specific complementary nucleic acid 
sequence involved in the expression of persephin such 
that the expression of persephin is reduced. Preferably, 
35 the specific nucleic acid sequence involved in the 

expression of persephin is a genomic DNA molecule or mRNA 
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molecule that contains sequences of the persephin gene. 
This genomic DNA molecule can comprise flanking regions 
of the persephin gene, untranslated regions of persephin 
mRNA, the pre- or pro- portions of the persephin gene or 
5 the coding sequence for mature persephin protein. The 
term complementary to a nucleotide sequence in the 
context of persephin antisense oligonucleotides and 
methods therefor means sufficiently complementary to such 
a sequence as to allow hybridization to that sequence in 
10 a cell, i.e., under physiological conditions. The 

persephin antisense oligonucleotides preferably comprise 
a sequence containing from about 8 to about 100 
nucleotides and more preferably the persephin antisense 
oligonucleotides comprise from about 15 to about 30 
15 nucleotides. The persephin antisense oligonucleotides 
can also contain a variety of modifications that confer 
resistance to nucleolytic degradation such as, for 
example, modified internucleoside linkages (Uhlmann and 
Peyman, Chemical Reviews 90:543-548, 1990; Schneider and 
20 Banner, Tetrahedron Lett 31:335, 1990 which are 

incorporated by reference), modified nucleic acid bases 
and/or sugars and the like. 

The therapeutic or pharmaceutical compositions of 
the present invention can be administered by any suitable 
25 route known in the art including for example intravenous, 
subcutaneous, intramuscular, transdermal, intrathecal or 
intracerebral. Administration can be either rapid as by 
injection or over a period of time as by slow infusion or 
administration of slow release formulation. For treating 
30 tissues in the central nervous system, administration can 
be by injection or infusion into the cerebrospinal fluid 
(CSF). When it is intended that persephin be 
administered to cells in the central nervous system, 
administration can be with one or more agents capable of 
35 promoting penetration of persephin across the blood-brain 
barrier. 
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Persephin can also be linked or conjugated with 
agents that provide desirable pharmaceutical or 
pharmacodynamic properties. For example, persephin can 
be coupled to any substance known in the art to promote 
5 penetration or transport across the blood-brain barrier 
such as an antibody to the transferrin receptor, and 
administered by intravenous injection. (See for example, 
Friden et al • , Science 259:373-377, 1993 which is 
incorporated by reference). Furthermore, persephin can 

10 be stably linked to a polymer such as polyethylene glycol 
to obtain desirable properties of solubility, stability, 
half-life and other pharmaceutical ly advantageous 
properties. (See for example Davis et al. Enzyme Eng 
4; 169-73, 1978; Burnham, Am J Hosp Pharm 52:210-218, 1994 

15 which are incorporated by reference). 

The compositions are usually employed in the form of 
pharmaceutical preparations. Such preparations are made 
in a manner well known in the pharmaceutical art. One 
preferred preparation utilizes a vehicle of physiological 

20 saline solution, but it is contemplated that other 
pharmaceutical ly acceptable carriers such as 
physiological concentrations of other non- toxic salts, 
five percent aqueous glucose solution, sterile water or 
the like may also be used. It may also be desirable that 

25 a suitable buffer be present in the composition. Such 

solutions can. If desired, be lyophilized and stored in a 
sterile ampoule ready for reconstitution by the addition 
of sterile water for ready injection. The primary 
solvent can be aqueous or alternatively non-aqueous. 

30 Persephin can also be incorporated into a solid or semi- 
solid biologically compatible matrix which can be 
implanted into tissues requiring treatment. 

The carrier can also contain other pharmaceutically- 
acceptable excipients for modifying or maintaining the 

35 pH, osmolarity, viscosity, clarity, color, sterility, 
stability, rate of dissolution, or odor of the 
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formulation. Similarly, the carrier may contain still 
other pharmaceutically-acceptable excipients for 
modifying or maintaining release or absorption or 
penetration across the blood-brain barrier. Such 
5 excipients are those substances usually and customarily 
employed to formulate dosages for parenteral 
administration in either unit dosage or multi-dose form 
or for direct infusion into the cerebrospinal fluid by 
continuous or periodic infusion. 
10 Dose administration can be repeated depending upon 

the pharmacokinetic parameters of the dosage formulation 
and the route of administration used. 

It is also contemplated that certain formulations 
containing persephin are to be administered orally- Such 
15 formulations are preferably encapsulated and formulated 
with suitable carriers in solid dosage forms. Some 
examples of suitable carriers, excipients, and diluents 
include lactose, dextrose, sucrose, sorbitol, mannitol, 
starches, gum acacia, calcium phosphate, alginates, 
20 calcium silicate, microcrystalline cellulose, 

polyvinylpyrrolidone, cellulose, gelatin, syrup, methyl 
cellulose, methyl- and propylhydroxybenzoates , talc, 
magnesium, stearate, water, mineral oil, and the like. 
The formulations can additionally include lubricating 
25 agents, wetting agents, emulsifying and suspending 

agents, preserving agents, sweetening agents or flavoring 
agents. The compositions may be formulated so as to 
provide rapid, sustained, or delayed release of the 
active ingredients after administration to the patient by 
30 employing procedures well known in the art. The 

formulations can also contain substances that diminish 
proteolytic degradation and promote absorption such as, 
for example, surface active agents. 

The specific dose is calculated according to the 
35 approximate body weight or body surface area of the 

patient or the volume of body space to be occupied. The 
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dose will also be calcula'ted dependeni: upon -the 
particular route of administration selected. Further 
refinement of the calculations necessary to determine the 
appropriate dosage for treatment is routinely made by 
5 those of ordinary skill in the art. Such calculations 
can be made without undue experimentation by one skilled 
in the art in light of the activity of persephin. The 
activity of neurturin in target cells data is disclosed 
herein and in copending application Serial Number 

10 08/519,777 and the concentration of persephin required 
for activity at the cellular level is believed to be 
similar to that of neurturin. The activity of persephin 
on mesencephalic cells is reported in Example 17 below. 
Persephin activity on a particular target cell type can 

15 be determined by routine experimentation. Exact dosages 
are determined in conjunction with standard dose-response 
studies. It will be understood that the amount of the 
composition actually administered will be determined by a 
practitioner, in the light of the relevant circumstances 

20 including the condition or conditions to be treated, the 
choice of composition to be administered, the age, 
weight, and response of the individual patient, the 
severity of the patient's symptoms, and the chosen route 
of administration. 

25 In one embodiment of this invention, persephin may 

be therapeutically administered by implanting into 
patients vectors or cells capable of producing a 
biologically-active form of persephin or a precursor of 
persephin, i.e. a molecule that can be readily converted 

30 to a biological -active form of persephin by the body. In 
one approach cells that secrete persephin may be 
encapsulated into semipermeable membranes for 
implantation into a patient. The cells can be cells that 
normally express persephin or a precursor of persephin or 
35 the cells can be transformed to express persephin or a 
precursor thereof. It is preferred that the cell be of 
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human origin and that the persephin be human persephin 
when the patient is human. However, the formulations and 
methods herein can be used for veterinary as well as 
human applications and the term "patient" as used herein 

5 is intended to include human and veterinary patients. 
Cells can be grown ex vlv^o for use in 
transplantation or engraf tment into patients ( Muench et 
al., Leuk & Lymph 16:1-11, 1994 which is incorporated by 
reference ) . In another embodiment of the present 
10 invention, persephin can be used to promote the ex vivo 
expansion of a cells for transplantation or engraf tment. 
Current methods have used bioreactor culture systems 
containing factors such as erythropoietin, colony 
stimulating factors, stem cell factor, and interleukins 

15 to expand hematopoietic progenitor cells for 

erythrocytes, monocytes, neutrophils, and lymphocytes 
(Verfaillie, Stem Cells 22:466-476, 1994 which is 
incorporated by reference). These stem cells can be 
isolated from the marrow of human donors, from human 

20 peripheral blood, or from umbilical cord blood cells. 

The expanded blood cells are used to treat patients who 
lack these cells as a result of specific disease 
conditions or as a result of high dose chemotherapy for 
treatment of malignancy (George, Stem Cells 12(Suppl 

25 i;.- 249-255, 1994 which is incorporated by reference). In 
the case of cell transplant after chemotherapy, 
autologous transplants can be performed by removing bone 
marrow cells before chemotherapy, expanding the cells ex 
vivo using methods that also function to purge malignant 

30 cells, and transplanting the expanded cells back into the 
patient following chemotherapy ( for review see Rummel and 
Van Zant, J Hematotherapy 3:213-218, 1994 which is 
incorporated by reference ) . Since persephin and the 
related growth factor, neurturin, may be expressed in the 

35 developing animal in particular tissues where 

proliferation and differentiation of progenitor cells 
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occur, it is believed "that persephin can function to 
regulate the proliferation of hematopoietic stem cells 
and the differentiation of mature hematopoietic cells. 
Thus, the addition of persephin to culture systems used 
5 for ex vivo expansion of cells could stimulate the rate 
at which certain populations of cells multiply or 
differentiate, and improve the effectiveness of these 
expansion systems in generating cells needed for 
transplant. 

10 It is also believed that persephin can be used for 

the ex vivo expansion of precursor cells in the nervous 
system. Transplant or engraftment of cells is currently 
being explored as a therapy for diseases in which certain 
populations of neurons are lost due to degeneration such 

15 as, for example, in parkinson's disease (Bjorklund, Curr 
Opln Neuroblol 2:683-689, 1992 which is incorporated by 
reference). Neuronal precursor cells can be obtained 
from animal or human donors or from human fetal tissue 
and then expanded in culture using persephin. These 

20 cells can then be engrafted into patients where they 

would function to replace some of the cells lost due to 
degeneration. Because neurotrophins have been shown to 
be capable of stimulating the survival and proliferation 
of neuronal precursors cells such as, for example, NT-3 

25 stimulation of sympathetic neuroblast cells (Birren et 
al.. Develop 129:597-610, 1993 which is incorporated by 
reference), persephin could also function in similar ways 
during the development of the nervous system and could be 
useful in the ex vivo expansion of neuronal cells. 

30 In a number of circumstances it would be desirable 

to determine the levels of persephin in a patient. The 
identification of persephin along with the present report 
that persephin is expressed by certain tissues provides 
the basis for the conclusion that the presence of 

35 persephin serves a normal physiologic function related to 
cell growth and survival. Indeed, other neurotrophic 
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factzors are known to play a role in the function of 
neuronal and non-neuronal tissues. (For review see 
Scully and Otten, Cell Biol Int 19:459-469, 1995; Otten 
and Gadient, Jnt J Devi Neurosclences 13:147-151, 1995 
5 which are incorporated by reference )• Endogenously 
produced persephin may also play a role in cert:ain 
disease conditions, particularly where there is cellular 
degeneration such as in neurodegenerative conditions or 
diseases. Other neurotrophic factors are known to change 

10 during disease conditions. For example, in multiple 
sclerosis, levels of NGF protein in the cerebrospinal 
fluid are increased during acute phases of the disease 
(Bracci-Laudiero et al., Neurosclence Lett 147:9-12, 1992 
which is incorporated by reference) and in systemic lupus 

15 erythematosus there is a correlation between inflammatory 
episodes and NGF levels in sera (Bracci-Laudiero et al . 
NeuiroReport 4:563-565, 1993 which is incorporated by 
reference ) . 

Given that persephin is expressed in certain 

20 t:issues, it is thus likely that the level of persephin 
may be altered in a variety of conditions and that 
quantification of persephin levels would provide 
clinically useful information. Furthermore, in the 
treatment of degenerative conditions, compositions 

25 containing persephin can be administered and it would 
likely be desirable to achieve certain target levels of 
persephin in sera, in cerebrospinal fluid or in any 
desired tissue compartment. It would, therefore, be 
advantageous to be able to monitor the levels of 

30 persephin in a patient. Accordingly, the present 
invention also provides methods for detecting the 
presence of persephin in a sample from a patient. 

The term "detection" as used herein in the context 
of detecting the presence of persephin in a patient is 

35 Intended to include the determining of the amount of 
persephin or the ability to express an amount of 
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persephin in a patient:, "bhe distinguishing of persephin 
from other growth factors, the estimation of prognosis in 
terms of probable outcome of a degenerative disease and 
prospect for recovery, the monitoring of the persephin 
5 levels over a period of time as a measure of status of 

the condition, and the monitoring of persephin levels for 
determining a preferred therapeutic regimen for the 
patient • 

To detect the presence of persephin in a patient, a 

10 sample is obtained from the patient- The sample can be a 
t:issue biopsy sample or a sample of blood, plasma, serum, 
CSF or the like. Persephin is expressed in kidney and 
brain tissues as shown in example 18 and it is believed 
that persephin is expressed in other tissues not tested 

15 as well. Samples for detecting persephin can be taken 
from any tissue expressing persephin. When assessing 
peripheral levels of persephin, it is preferred that the 
sample be a sample of blood, plasma or serum or 
alternatively from a tissue biopsy sample. When 

20 assessing the levels of persephin in the central nervous 
system a preferred sample is a sample obtained from 
cerebrospinal fluid « 

In some instances it is desirable to determine 
whether the persephin gene is intact in the patient or in 

25 a tissue or cell line within the patient. By an intact 
persephin gene it Is meant that there are no alterations 
In the gene such as point mutations, deletions. 
Insertions, chromosomal breakage, chromosomal 
rearrangements and the like wherein such alteration might 

30 alter production of persephin or alter its biological 
activity, stability or the like to lead to disease 
processes or susceptibility to cellular degenerative 
conditions. Conversely, by a non- intact persephin gene 
it is meant that such alterations are present. Thus, in 

35 one embodiment of the present invention a method is 

provided for detecting and characterizing any alterations 
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in t:he persephin gene. The method comprises providing an 
oligonucleotide that contains the persephin cDNA, genomic 
DNA or a fragment thereof or a derivative thereof . By a 
derivative of an oligonucleotide, it is meant that the 
5 derived oligonucleotide is substantially the same as the 
sequence from which it is derived in that the derived 
sequence has sufficient sequence complement arily to the 
sequence from which it is derived to hybridize to the 
persephin gene. The derived nucleotide sequence is not 
10 necessarily physically derived from the nucleotide 

sequence^ but may be generated in any manner including 
for example, chemical synthesis or DNA replication or 
reverse transcription or transcription. 

Typically, patient genomic DNA is isolated from a 
15 cell sample from the patient and digested with one or 

more restriction endonucleases such as, for example, TaqI 
and Alul . Using the Southern blot protocol, which is 
well known in the art, this assay determines whether a 
patient or a particular tissue in a patient has an intact 
20 persephin gene or a persephin gene abnormality. 

Hybridization to the persephin gene would involve 
denaturing the chromosomal DNA to obtain a single- 
stranded DNA; contacting the single- stranded DNA with a 
gene probe associated with the persephin gene sequence; 
25 and identifying the hybridized DNA-probe to detect 
chromosomal DNA containing at least a portion of the 
human persephin gene. 

The term "probe" as used herein refers to a 
structure comprised of a polynucleotide which forms a 
30 hybrid structure with a target sequence, due to 

complementarity of probe sequence with a sequence in the 
target region. The probes need not reflect the exact 
sequence of the target sequence, but must be sufficiently 
complementary to selectively hybridize with the strand 
35 being amplified. By selective hybridization or specific 
hybridization it is meant that a polynucleotide 
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preferentially hybridizes to a target polynucleotide. 
Oligomers suitable for use as probes may contain a 
minimum of about 8-12 contiguous nucleotides which are 
complementary to the targeted sequence and preferably a 
5 minimum of about 15 nucleotides although polynucleotide 
probes up to about 20 nucleotides and up to about 100 
nucleotides or even greater are within the scope of this 
invention. 

The persephin gene probes of the present invention 
10 can be DNA or RNA oligonucleotides and can be made by any 
method known in the art such as, for example, excision, 
transcription or chemical synthesis . Probes may be 
labelled with any detectable label known in the art such 
as, for example, radioactive or fluorescent labels or 
15 enzymatic marker. Labeling of the probe can be 

accomplished by any method known in the art such as by 
PGR, random priming, end labelling, nick translation or 
the like. One skilled in the art will also recognize 
that other methods not employing a labelled probe can be 
20 used to determine the hybridization* Examples of methods 
that can be used for detecting hybridization include 
Southern blotting, fluorescence in situ hybridization, 
and single- strand conformation polymorphism with PGR 
amplification. 

25 Hybridization is typically carried out at 25-45 **G, 

more preferably at 32-40 **C and more preferably at 37- 
38 *G. The time required for hybridization is from about 
0.25 to about 96 hours, more preferably from about one to 
about 72 hours, and most preferably from about 4 to about 

30 24 hours. 

Persephin gene abnormalities can also be detected by 
using the PGR method and primers that flank or lie within 
the persephin gene. The PGR method is well known in the 
art. Briefly, this method is performed using two 
35 oligonucleotide primers which are capable of hybridizing 
to the nucleic acid sequences flanking a target sequence 
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that lies within a persephin gene and amplifying the 
target sequence. The terms "oligonucleotide primer" as 
used herein refers to a short strand of DNA or RNA 
typically ranging in length from about 8 to about 30 
5 bases. The upstream and downstream primers are 

preferably a minimum of from about 15 nucleotides to 
about 20 nucleotides and up to about 30 nucleotides or 
even greater in length. The primers can hybridize to the 
flanking regions for replication of the nucleotide 

10 sequence. The polymerization is catalyzed by a DNA- 
polymerase in the presence of deoxynucleotide 
triphosphates or nucleotide analogs to produce double- 
stranded DNA molecules. The double strands are then 
separated by any denaturing method including physical, 

15 chemical or enzymatic. Commonly, the method of physical 
denaturation is used involving heating the nucleic acid, 
typically to temperatures from about 80 °C to 105 for 
times ranging from about 1 to about 10 minutes. The 
process is repeated for the desired number of cycles. 

20 The primers are selected to be substantially 

complementary to the strand of DNA being amplified. 
Therefore, the primers need not reflect the exact 
sequence of the template, but must be sufficiently 
complementary to selectively hybridize or specifically 

25 hybridize with the strand being amplified. By selective 
hybridization or specific hybridization it is meant that 
a polynucleotide preferentially hybridizes to a target 
polynucleotide . 

After PGR amplification, the DNA sequence comprising 

30 persephin or pre-pro persephin or a fragment thereof is 

then directly sequenced and analyzed by comparison of the 
sequence with the sequences disclosed herein to identify 
alterations which might change activity or expression 
levels or the like. 
35 In another embodiment a method for detecting 

persephin is provided based upon an analysis of tissue 
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expressing the persephin gene. Certain tissues such as 
those identified below in example 18 have been found to 
express the persephin gene. The method comprises 
hybridizing a polynucleotide probe to mRNA from a sample 
5 of tissues that normally express the persephin gene or 
from a cDNA produced from the mRNA of the sample. The 
sample is obtained from a patient suspected of having an 
abnormality in the persephin gene or from a particular 
patient tissue or cell type suspected of having an 
10 abnormality in the persephin gene. The reference 
persephin polynucleotide probe can comprise SEQ ID 
NO: 179, SEQ ID NO: 180, SEQ ID NO: 190, SEQ ID NO: 191, SEQ 
ID NOS: 203-206 or derivatives thereof or fragments 
thereof so long as such derivatives or fragments 
15 specifically hybridize to persephin mRNA or from a cDNA 
produced from a persephin mRNA. 

To detect the presence of mRNA encoding persephin 
protein, a sample is obtained from a patient. The sample 
can be from blood or from a tissue biopsy sample. The 
20 sample may be treated to extract the nucleic acids 

contained therein. The resulting nucleic acid from the 
sample is subjected to gel electrophoresis or other size 
separation techniques . 

The mRNA of the sample is contacted with a nucleic 
25 acid serving as a probe to form hybrid duplexes. The use 
of a labeled probes as discussed above allows detection 
of the resulting duplex. 

When using the cDNA encoding persephin protein or a 
derivative of the cDNA as a probe, high stringency 
30 conditions can be used in order to prevent false 
positives, that is the hybridization and apparent 
detection of persephin nucleotide sequences when in fact 
an intact and functioning persephin gene is not present. 
When using sequences derived from the persephin cDNA, 
35 less stringent conditions could be used, however, this 
would be a less preferred approach because of the 



BNSOOCID: <WO_9914235A1_L> 



wo 99/14235 PCT/US98/19163 



57 

likelihood of false positives. The stringency of 
hybridization is determined by a number of factors during 
hybridization and during the washing procedure, including 
temperature, ionic strength, length of time and 
5 concentration of formamide. These factors are outlined 
in, for example, Sambrook et al . (Sambrook, et al . , 1989, 
supra ) • 

In order to increase the sensitivity of the 
detection in a sample of mRNA encoding the persephin 

10 protein, the technique of reverse 

transcription/polymerization chain reaction (RT/PCR) can 
be used to amplify cDNA transcribed from mRNA encoding 
the persephin protein- The method of RT/PCR is well 
known in the art (see example 9 and figure 6 below). 

15 The RT/PCR method can be performed as follows. 

Total cellular RNA is isolated by, for example, the 
standard guanidium isothiocyanate method and the total 
RNA is reverse transcribed. The reverse transcription 
method involves synthesis of DNA on a template of RNA 

20 using a reverse transcriptase enzyme and a 3* end primer. 
Typically, the primer contains an oligo(dT) sequence. 
The cDNA thus produced is then amplified using the PCR 
method and persephin specific primers. (Belyavsky et al, 
Nucl Acid Res 17:2919-2932, 1989; Krug and Berger, 

25 MetJiods in Enzymology, Academic Press, N.Y., Vol.152, pp. 
316-325, 1987 which are incorporated by reference). 

The polymerase chain reaction method is performed as 
described above using two oligonucleotide primers that 
are substantially complementary to the two flanking 

30 regions of the DNA segment to be amplified. 

Following amplification, the PCR product is then 
electrophoresed and detected by ethidium bromide staining 
or by phosphoimaging . 

The present invention further provides for methods 

35 to detect the presence of the persephin protein in a 

sample obtained from a patient. Any method known in the 
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art for detecting proteins can be used. Such methods 
include, but are not limited to immunodiffusion, 
immunoelectrophoresis , immunochemical methods , binder- 
ligand assays, immunohistochemical techniques, 
5 agglutination and complement assays. (for example see 
Basic and Clinical immunology. Sites and Terr, eds., 
Appleton & Lange, Norwalk, Conn, pp 217-262, 1991 which 
is incorporated by reference). Preferred are binder- 
ligand immunoassay methods including reacting antibodies 
10 with an epitope or epitopes of the persephin protein or 
derivative thereof and competitively displacing the 
labeled persephin protein or derivative thereof. 

As used herein, a derivative of persephin protein is 
intended to include a polypeptide in which certain amino 
15 acids have been deleted or replaced with other amino 
acids or changed to modified or unusual amino acids 
wherein the persephin derivative is biologically 
equivalent to persephin and/or wherein the polypeptide 
derivative cross-reacts with antibodies raised against 
20 the persephin protein. By cross-reaction it is meant 
that an antibody reacts with an antigen other than the 
one that induced its formation. 

Numerous competitive and non-competitive protein 
binding immunoassays are well known in the art. 
25 Antibodies employed in such assays may be unlabeled, for 
example as used in agglutination tests, or labeled for 
use in a wide variety of assay methods. Labels that can 
be used include radionuclides, enzymes, fluorescers, 
chemiluminescers, enzyme substrates or co- factors, enzyme 
30 inhibitors, particles, dyes and the like for use in 
radioimmunoassay ( RIA ) , enzyme immunoassays , e.g., 
enzyme-linked immunosorbent assay (ELISA), fluorescent 
immunoassays and the like. 

Polyclonal or monoclonal antibodies to the persephin 
35 protein or to an epitope thereof can be made for use in 
immunoassays by any of a number of methods known in the 
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ar-t. By epitope reference is made to an antigenic 
determinant of a polypeptide. The term epitope can also 
include persephin-specif ic B cell epitopes or T helper 
cell epitopes. An epitope could comprise 3 amino acids 
5 in a spacial conformation which is unique to the epitope. 
Generally an epitope consists of at least 5 such amino 
acids. Methods of determining the spatial conformation 
of amino acids are known in the art, and include, for 
example, x-ray crystallography and 2 dimensional nuclear 

10 magnetic resonance. 

One approach for preparing antibodies to a protein 
is the selection and preparation of an amino acid 
sequence of all or part of the protein, chemically 
synthesizing the sequence and injecting it into an 

15 appropriate animal, usually a rabbit or a mouse (See 
Example 10 ) . 

Oligopeptides can be selected as candidates for the 
production of an antibody to the persephin protein based 
upon the oligopeptides lying in hydrophilic regions, 
20 which are thus likely to be exposed in the mature 
protein. 

Antibodies to persephin can also be raised against 
oligopeptides that include one or more of the conserved 
regions identified herein such that the antibody can 
25 cross-react with other family members. Such antibodies 
can be used to Identify and Isolate the other family 
members • 

Methods for preparation of the persephin protein or 
an epitope thereof include, but are not limited to 

30 chemical synthesis, recombinant DNA techniques or 

Isolation from biological samples. Chemical synthesis of 
a peptide can be performed, for example, by the classical 
Merrlfeld method of solid phase peptide synthesis 
(Merrlfeld, J Am Chem Soc 85r2149, 1963 which Is 

35 Incorporated by reference) or the FMOC strategy on a 

Rapid Automated Multiple Peptide Synthesis system (DuPont 
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Company, Wilmington, DE ) ( Caprino and Han, J Org Chem 
37:3404, 1972 which is incorporated by reference). 

Polyclonal antibodies can be prepared by immunizing 
rabbits or other animals by injecting antigen followed by 
5 subsequent boosts at appropriate intervals. The animals 
are bled and sera assayed against purified persephin 
protein usually by ELISA or by bioassay based upon the 
ability to block the action of persephin. When using 
avian species, e.g. chicken, turkey and the like, the 

10 antibody can be isolated from the yolk of the egg. 

Monoclonal antibodies can be prepared after the method of 
Milstein and Kohler by fusing splenocytes from immunized 
mice with continuously replicating tumor cells such as 
myeloma or lymphoma cells. (Milstein and Kohler Nature 

15 256:495-497, 1975; Gulf re and Milstein, Methods In 

Enzymology : Immunochemical Techniques 73:1-46, Langone 
and Banatis eds.. Academic Press, 1981 which are 
incorporated by reference). The hybridoma cells so 
formed are then cloned by limiting dilution methods and 

20 supernates assayed for antibody production by ELISA, RIA 
or bioassay. 

The unique ability of antibodies to recognize and 
specifically bind to target proteins provides an approach 
for treating an over expression of the protein. Thus, 
25 another aspect of the present invention provides for a 

method for preventing or treating diseases involving over 
expression of the persephin protein by treatment of a 
patient with specific antibodies to the persephin 
protein . 

30 Specific antibodies, either polyclonal or 

monoclonal, to the persephin protein can be produced by 
any suitable method known in the art as discussed above. 
For example, murine or human monoclonal antibodies can be 
produced by hybridoma technology or, alternatively, the 

35 persephin protein, or an immunologically active fragment 
thereof, or an ant i- idiotypic antibody, or fragment 
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thereof can be administered to an animal to elicit the 
production of antibodies capable of recognizing and 
binding to the persephin protein. Such antibodies can be 
from any class of antibodies including, but not limited 
5 to igG, igA, igM, igD, and igE or in the case of avian 
species, IgY and from any subclass of antibodies. 

Preferred embodiments of the invention are described 
in the following examples. Other embodiments within the 
scope of the claims herein will be apparent to one 

10 skilled in the art from consideration of the specifica- 
tion or practice of the invention as disclosed herein. 
It is intended that the specification, together with the 
examples, be considered exemplary only, with the scope 
and spirit of the invention being indicated by the claims 

15 which follow the examples. 

Example 1 

This example illustrates the isolation and 
purification of neurturin from CHO cell conditioned 
20 medium. 

Preparation of CHO cell conditioned medium : 

A derivative of DG44 Chinese hamster ovary cells, 
DG44CHO-pHSP-NGFl-B (CHO) cells, was used (Day et al, J 
B±ol Chem 265:15253-15260, 1990 which is incorporated by 

25 reference). The inventors herein have also obtained 
neurturin in partially purified form from other 
derivatives of DG44 Chinese hamster ovary cells. The CHO 
cells were maintained in 20 ml medium containing minimum 
essential medium (MEM) alpha (Gibco-BRL No. 12561, 

30 Gaithersburg, MD ) containing 10% fetal calf serum 

(Hyclone Laboratories, Logan, UT ) , 2 mM 1-glutamine, 100 
U/ml penicillin, 100 pg/ml streptomycin and 25nM 
methotrexate using 150 cm^ flasks (Corning Inc., Corning 
NY). For passage and expansion, medium from a confluent 

35 flask was aspirated; the cells were washed with 10 ml 

phosphate buffered saline (PBS) containing in g/1, 0.144 
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KH2PO4, 0,795 NajHPO^ and 9.00 NaCl; and the flask was then 
incubat:ed for 2-3 minutes with 2 ml 0,25% trypsin in PBS. 
Cells were "then knocked off the flask surface, 8 ml of 
medium were added and cells were triturated several times 
5 with a pipette. The cells were split 1:5 or 1:10, 

incubated at 37^*0 under an atmosphere of 5% CO2 in air and 
grown to confluence for 3-4 days. 

The cell culture was then expanded into 850 cm^ 
roller bottles (Becton Dickinson, Bedford, ^4A). A 

10 confluent 150 cm^ flask was trypsinized and seeded into 

one roller bottle containing 240 ml of the above modified 
MEM medium without me1:hot:rexate . The pH was maintained 
either by blanketing the medium with 5% CO2 in air or by 
preparing the medium with 25 mM HEPES pH 7.4 (Sigma, St- 

15 Louis, MO). The roller bottles were rotated at 0.8-1.0 
revolutions per minute. Cells reached confluence in 4 
days. 

For collecting conditioned medium, serum- free CHO 
cell (SF-CHO) medium was used. SF-CHO was prepared using 

20 1:1 DME/F12 base medium, which was prepared by mixing 1:1 
(v/v) DMEM (Gibco-BRL product No. 11965, Gibco-BRL, 
Gaithersburg, MD) with Ham's F12 (Gibco-BRL product No. 
11765). The final SF-CHO medium contained 15 mM HEPES pH 
7,4 (Sigma, St. Louis, MO), 0.5 mg/ml bovine serum 

25 albumin (BSA, Sigma, St. Louis MO), 25 pg/ml heparin, 

(Sigma, St. Louis, MO), IX insulin- transferrin-selenite 
supplement (bovine insulin, 5 jig/ml; human transferrin, 5 
yg/ml; sodium selenite, 5 ng/ml; Sigma, St. Louis, MO), 2 
mM l-glutamine, 100 U/ml penicillin, and 100 \xg/ml 

30 streptomycin. The medium from the confluent roller 

bottles was removed and the cells washed once with 30 ml 
SF-CHO medium to remove serum proteins. Cells were then 
incubated at 37 °C for 16-24 hrs in 80 ml SF-CHO medium to 
further remove serum proteins. The 80 ml medium was 

35 removed and discarded. A volume of 120 ml of SF-CHO 

medium was added to the flask and the cells incubated at 
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37**C. Every 48 hrs -thereafter, 120 ml was collected and 
replaced with the same volume of SF-CHO medium. 

Collected media was pooled and centrifuged at 4''C in 
polypropylene conical tubes to remove cellular debris and 
5 the supernatant stored at -70 "^C. Media was collected 5 
times over 10 days to yield a total of approximately 600 
ml conditioned medium per roller bottle. 

Fractions collected from the columns at each stage 
of purification were assayed for biological activity 
10 using the neuronal survival assay and for protein content 
by the dye binding assay of Bradford (Anal Blochem 72 z 248 
et seg., 1976 which is incorporated by reference). The 
total mg of protein in the starting volume, typically 50 
liters, of conditioned medium was determined. 
15 Superior Cervical Ganglion Survival Assay : 

The neurotrophic activity of CHO conditioned medium 
starting material and at various stages of purification 
was assessed using the superior cervical ganglion 
survival assay system previously reported (Martin, et al 
20 J of Cell Biology 106:829-844; Deckwerth and Johnson, J 
Cell Bio 123:1207-1222, 1993 which are incorporated by 
reference )• Primary cultures of sympathetic neurons from 
superior cervical ganglion (SCG) were prepared by 
dissecting tissue from Day 20-21 rat embryo (E20-E21). 
25 The SCG's were placed in Leibovitz's L15 with 1-glutamine 
medium (Cat #11415-023 Gibco-BRL, Gaithersburg, MD), 
digested for 30 minutes with 1 mg/ml collagenase (Ciat 
#4188 Worthington Biochemical, Freehold, N J ) in 
Leibovitz's L15 medium at 37^C, followed by a 30 minute 
30 digestion in trypsin- lyophili zed & irradiated (Type 

TRLVMF Cat #4454 Worthington Biochemical, Freehold, N J ) 
which was resuspended in modified Hanks' Balanced Salt 
Solution (Cat #H-8389 Sigma Chemical Co., St. Louis, MO). 
The digestion was stopped using AM50 which contains 
35 Minimum Essential Medium with Earle's salts and without 
1-glutamine (Cat #11090-016 Gibco-BRL), 10% fetal calf 
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serum (Cat #1115 Hyclone Laboratories, Logan, UT ) , 2mM 1- 
glutamine (Cat #G5763 Sigma Chemical Co., St. Louis, MO), 
20 yM FuDr (F-0503 Sigma Chemical Co., St. Louis, MO), 20 
pM Uridine (Cat #3003 Sigma Chemical Co-, St. Louis, MO), 
5 100 U/ml penicillin, 100 //g/ml Streptomycin, and 50 ng/ml 
2.5 S NGF. The cells were dissociated into a suspension 
of single cells using a silanized and flame-polished 
Pasteur pipet. After filtration of the suspension 
through a nitex filter (size 3-20/14, Tetko Inc., 

10 Elmsford, NY), the cells were placed in AM50 medium as 
above and preplated on a 100 mm Falcon or Primaria 
culture dish (Becton Dickinson Labware, Lincoln Park, N J ) 
to reduce the number of non-neuronal cells. After 2 
hours, the medium containing the unattached neuronal 

15 cells was removed from these dishes and triturated again 
through a silanized and flame-polished Pasteur pipet. 
The single cell suspension was plated on 24-well tissue 
culture plates (Costar, Wilmington, MA) that have been 
previously coated with a double layer of collagen, one 

20 layer of collagen that had been ammoniated and a second 
layer of collagen that had been air dried. They were 
allowed to attach for 30 minutes to 2 hours. A specific 
number of viable cells, usually about 1200 to about 3000 
total cells per well, or a specific percentage of the 

25 ganglion, usually 25% of the cells obtained per ganglion 
were plated into each well. When cell counts were to be 
performed they were placed in the 24-well dishes as 
stated above or alternatively, on 2-well chamber slides 
(Nunc, Naperville, XL). Cultures were then incubated for 

30 5-6 days at 37** in AM50 medium in a 5% C02/95% air 
atmosphere. The death of the cultured neurons was 
induced by exchanging the medium with medium without NGF 
and with 0.05% goat anti-NGF (final titer in the wells is 
1:10). This NGF-deprivation results in death of the 

35 neurons over a period of 24-72 hours. Aliquots of 

partially purified or purified factor, or appropriate 
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controls, were added to the cultures at the time of NGF 
removal to determine the ability to prevent the neuronal 
death . 

Evaluation of the ability of column fraction, gel 
5 eluates, or purified factor to prevent neuronal death was 
by visual inspection of cultures under phase contrast 
microscopy. Viable neurons remained phase bright with 
intact neurites, whereas dead neurons were shrunken, 
phase dark, had irregular membranes and neurites were 

10 fragmented ( Figure 3 ) . Where precise quantitation of 
neuronal survival was required, the cultures were fixed 
in 4% paraformaldehyde or 10% Formalin in PBS, and 
stained with crystal violet solution, (Huntoon Formula 
Harleco E.M. Diagnostics Systems, Gibbstown, N J ) . When 

15 using 24 well dishes, 1 ^ul crystal violet solution was 
added to each well containing 10% formalin and the cells 
were counted using a phase contrast microscope- If the 
2-well chamber slides were used, the cultures were fixed, 
stained with crystal violet, destained with water, 

20 dehydrated in increasing ethanol concentrations to 
toluene, and mounted in a toluene-based mounting 
solutxon. Neurons were scored as viable if they had a 
clear nucleolus and nuclei and were clearly stained with 
crystal violet. 

25 The neuronal death at 72 hours in shown in Figure 

3B. Also shown are (A) the positive control cells 
maintained with nerve growth factor and (C) the cells 
treated with anti-NGF and neurturin (approximately 3 
ng/ml ) showing survival of neurons . 

30 Activity was quantitated by calculation of a 

"survival unit". The total survival units in a sample 
were defined as the minimal volume of an aliquot of the 
sample which produced maximal survival divided into the 
total volume of that sample. Specific activity was 
35 calculated as the survival units divided by the mg total 
protein. 
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Survival units were determined in an assay using 
approximately 1200 viable neurons in a 0,5 ml culture 
assay and a culture period of 48 hours following addition 
of the fraction. Survival was assessed visually after 
5 the 48 hours. Intrinsic activity as shown in Figure 4 
was determined in an assay using approximately 2700 
neurons and a culture period of 72 hours. Survival was 
assessed by fixing the neurons and counting the number of 
surviving neurons. Because the stability, as assessed by 

10 half -life of activity, for neurturin decreases as the 
number of neurons increases, the intrinsic activity 
measurement would be expected to be lower than that 
predicted by Specific Activity determinations. The 
intrinsic activity measurement would also be expected to 

15 be lower than that predicted by specific activity because 
the survival was measured after 72 hours instead of 48 
hours . 

To ensure the reproducibility of these activity unit 
assays, it was necessary to plate the primary neuronal 

20 cultures at reproducible cell densities, as the stability 
of the activity decreases significantly with increasing 
neuronal density. The range of cell densities was from 
about 1200 to about 2700 cells per well. The presence of 
soluble heparin in the assay medium had no effect on the 

25 short-term (-3 days) stability of the survival activity. 
Purification of Neurturin ; 

Pooled conditioned medium was filtered through 0.2 
pi pore bottle-top filters (cellulose acetate membrane. 
Corning Inc., Corning, NY). Typically 50 liters of 

30 conditioned medium was used and processed in 25 liter 

batches. Each 25 liter batch was introduced at a rate of 
20 ml/min onto a 5 x 5 cm column containing 100 ml 
heparin-agarose (Sigma, St. Louis, MO) equilibrated with 
25 mM HEPES, pH 7 . 4 buffer with 150 mM NaCl . The column 

35 was then washed with approximately 1000 ml 25 mM HEPES, 
pH 7.4 buffer containing 0.5 M NaCl at 20 ml/min and the 
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activity was then eluted with 25 mM HEPES, pH 7.4 buffer 
containing 1,0 M NaCl . After switching to the l.OM NaCL 
elution buffer, the first 50 ml of buffer was discarded 
and, thereafter, one 300 ml fraction was collected, 
5 Pooled material eluted from the Heparin-agarose 

column was then diluted 1:1 (v/v) with 25 mM HEPES, pH 
7.4 buffer containing 0.04% TWEEN 20 to a NaCl 
concentration of 0.5 M and introduced into a 1 . 5 cm x 9 
cm column containing 16 ml SP SEPHAROSE® High Performance 

10 ion exchange resin (Pharmacia, Piscataway, N J ) 

equilibrated in 25 mM HEPES 7.4 containing 0.5 M NaCl and 
0.02% TWEEN 20. The column was then washed with 160 ml 
25 mM HEPES, pH 7.4 buffer containing 0.5 M NaCl and 
0.02% TWEEN 20 and the activity was eluted with 25 mM 

15 HEPES, pH 7.4 buffer containing 1,0 M NaCl and 0.02% 

TWEEN 20 at a flow rate of 2 ml/min. One 50 ml fraction 
was collected after the first 7 ml of eluate from the 
column. 

Material eluted from the SP SEPHAROSE® column was 

20 fractionated using fast protein liquid chromatography 
(FPLC) on a Chelating Superose HR 10/2 column charged 
with Cu** (Pharmacia, Piscataway, N J ) . The column had 
been prepared by washing with 10 ml water, charging with 
3 ml of 2.5 mg/ml CuSO^- 5H2O, washing with 10 ml water, 

25 and equilibrating with 10 ml of 25 mM HEPES pH 7.4 buffer 
containing 1.0 M NaCl and 0.02% TWEEN 20. The eluate was 
introduced into the column in 25 mM HEPES, pH 7 . 4 buffer 
containing 1.0 M NaCl at a rate of 1.0 ral/min. The bound 
proteins were eluted with a linear gradient of increasing 

30 glycine concentration (0-300 mM) in 25 mM HEPES, pH 7.4 
buffer containing 1.0 M NaCl at a rate of 1.0 ml/min. 
The gradient was produced by a Pharmacia FPLC system 
using an LCC-500 controller and P-500 pumps to establish 
a 0-300 mM glycine gradient in 40 ml at 1-0 ml/min, thus 

35 increasing the gradient by 7.5 mM glucine per min. One 
ml fractions were collected and assayed for SCG survival 
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promo-tion. Peak activity was observed in fractions 17- 
20, i.e. 17-20 min or ml from the start of the gradient. 

Absorbance measurements at 280 nM by an in-line UV 
monitor indicated that most proteins eluted prior to the 
5 survival activity in fractions 17-20. Thus, significant 
purification was achieved at this step. A 25 kD band co- 
purified with the survival activity • 

The combined eluted fractions from the Cu** superose 
column were diluted to 0.45 M NaCl using 25 mM HEPES pH 

10 7.4 buffer containing 0.02% TWEEN 20 and introduced into 
a Mono S HR 5/5 cation exchange column (Pharmacia, 
Piscataway, NJ) for further FPLC purification. The 
column had been equilibrated with 25 mM HEPES pH 7.4 
buffer containing 0.45 M NaCl containing 0.02% TWEEN 20. 

15 Bound proteins were eluted with a linear gradient of 

increasing NaCl concentration (0.45-1.0 M). The gradient 
was produced as described above from 0.45 M - 1.0 M NaCl 
in 35 mis at 1.0 ml/min, thus increasing concentration at 
0.0157 M per ml or min. Thirteen 1 . 0 ml fractions 

20 (fractions 1-13) were collected followed by 44 0.5 ml 

fractions (fractions 14-53). Peak activity in SCG assay 
was in fractions 26-29. Each fraction was assayed in the 
SCG suarvival assay over a range of volumes of from 0.1 to 
1.0 ]xl per 0.5 ml culture medium. 

25 One percent (5 pi ) of each fraction was loaded onto 

a non-reducing, 14% SDS polyacrylamide gel and 
electrophoresed for 750 V-hr at 25**C. Proteins were 
visualized by silver stain. The results are shown in 
Figure 2. Markers shown in lane M on the gel represent 

30 20 ng of Bovine serum albumin, carbonic anhydrase, B- 
lactoglobulin, and lysozyme in the order of descending 
molecular weight. 

A 25 kD band appeared in fractions 25-30, a 28 kD 
protein elutes earlier in the gradient and an 18 kD 

35 elutes later in the gradient. Figure 2 illustrates the 
survival activity in each of the fractions. The survival 
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act:ivit:y is noted to correspond with the presence and 
apparent intensity of the 25 kD protein in fractions 25- 
30. 

To demonstrate that the 25 kD band was responsible 
5 for survival promoting activity, the 25 kD protein was 
eluted from the polyacrylamide gel after electrophoresis 
and assayed for survival activity in the SCG assay. 
After electrophoresis of 150 pi of the SP SEPHAROSE® 1.0 
M NaCl fraction in one lane of a non-reducing 14% SDS- 
10 polyacrylamide gel as above, the lane was cut into 12 

slices and each slice was crushed and eluted by diffusion 
with rocking in buffer containing 25 mM HEPES, pH 7.4, 
0.5 M NaCl, 0.02% Tween-20 for 18 hr at 25 "C. BSA was 
added to the eluate to a final concentration of 200 pg/ml 
15 and the eluate was filtered through a 0.45 micron filter 
to remove acrylamide gel fragments. The filtrate was 
then added to a SP SEPHAROSE® column to concentrate and 
purify the sample. Before eluting the sample, the column 
was washed once in 400 pi 25 mM HEPES, pH 7.4 buffer 
20 containing 0.5 M NaCl, 0.02% Tween-20 and 200 pg BSA per 
ml and once in 400 pi 25 mM HEPES, pH 7.4 buffer 
containing 0.02% Tween-20 and 200 pg BSA per ml. The 
column was then washed again in 400 pi of 25 mM HEPES, pH 
7.4 buffer containing 0.5 M NaCl, 0.02% TWEEN 20 and 200 
25 ug BSA per ml. The sample was eluted with 25 mM HEPES, 
pH 7.4 buffer containing 1.0 M NaCl, 0.02% Tween-20 and 
200 pg BSA per ml. Samples were then analyzed for 
survival activity. Only the slice corresponding to the 
25 kD band showed evidence of survival activity. The 25 
30 kD protein purified from CHO cell conditioned media is 
believed to be a homodimer. 

The yield from the purification above was typically 
1-1.5 pg from 50 liters of CHO cell conditioned medium. 
Overall recovery is estimated to be 10-30%, resulting in 
35 a purification of approximately 390,000 fold. 
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The progressive purification using the above steps 
is shown in table 2. 

Table 2 

5 





Protein' 
(mg) 


Activity" 
(units) 


Specific 
Activity** 
(units/mg) 


Yield 
(%) 


Purification 
(fold) 


Conditioned 
Medium 


5000 


48000^= 


9.6 






Heparin 
Agarose 


45 


48000 


1068 


100 


111 


SP Sepliarose 


5.3 


48()()() 


9058 


100 


943 


Cu++ Superose 


0.31 


30000 


96700 


62 


10070 


Mono S 


0.004 


15000 


3750000 


31 


390000 



15 a. mg protein was dctennincd using the dye binding method of Bradford 

(Anal Biochem 72:248, 1976). 

b. The total activity units or survival units in a sample were defined as 
the minimal volume of an aliquot of the sample which produced 

20 maximal survival divided into the total volume of that sample. 

c. Activity for Conditioned Medium was derived from the assumption 
that 100% of the activity was recovered in the heparin agarose fraction 
because the activity of conditioned medium was loo low to be directly 

25 assayed. 

d. Specific Activity was the Activity units divided by the mg total 
protein. 

30 

Example 2 

This example illustrates the characterization of 
neurturin and several members of the TGF-B family of 
growth factors in the SCG assay and the lack of cross 
35 reactivity of anti-GDNF antibodies with neurturin. 

The SCG assay of the purified protein indicated 
that the factor is maximally active at a concentration of 
approximately 3 ng/ml or approximately 100 pM and the EC50 
was approximately 1.5 ng/ml or approximately 50 pM in the 
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expected range for a diffusible peptide growth factor 
{ Figure 4 ) . 

Several members of the TGF-6 family influence 
neuropeptide gene expression in sympathetic neurons, 
5 while others promote survival of different neuronal 
populations. Neurturin, which is a distant member of 
this family of proteins, is capable of promoting 
virtually complete survival of sympathetic neurons for 3 
days. In addition, further culturing of the SCG cells 
10 revealed that neurturin could continue to maintain these 
neurons for at least 10 days after withdrawal of NGF. 

We tested several other members of the TGF-6 
family for their ability to promote survival in the SCG 
assay including TGF-61, activin, BMP-2, BMP-4, BMP- 6 and 
15 GDNF. Of these factors, only GDNF had survival promoting 
activity, however, the activity of GDNF was much less 
potent than neurturin in this activity showing an EC^o of 
2-4 nM in the 3-day survival assay. The GDNF tested in 
this assay was rhGDNF produced in E. Coll obtained from 
20 Prepro Tech, Inc., Rocky Hill, N.J. The duration of 
action of GDNF was also less than that of neurturin 
inasmuch as the ability of GDNF ( 50 ng/ml ) to maintain 
survival longer than 3 days was substantially diminished. 
These experiments suggest the possibility that GDNF is a 
25 weak agonist for the neurturin receptor. Furthermore, 
the inability of activin and BMP-2 to promote survival, 
in contrast to their strong induction of transmitter- 
related gene expression in these neurons ( Fann and 
Paterson, Int J Dev Neuroscl 23:317-330, 1995; Fann and 
30 Patterson, J Neurochem 61:1349-1355, 1993) suggests that 
they signal through alternate receptors or signal 
transduction pathways. 

To determine the cross-reactivity of anti-GDNF 
antibodies with partially purified neurturin, SCG 
35 neurons, that had been dissected and plated as described 
in Example 1 were treated on Day 6 with 1 ng/ml, 3 ng/ml. 



BNSOOCiD: <WO 9914235A1J_> 



wo 99/14235 



PCT/US98/19163 



72 

10 ng/ml, or 30 ng/ml GDNF ( Prepro Tech, Inc, Rocky Hill, 
N. J. ) in the presence of anti-NGF alone, or in the 
presence of anti-NGF and anti-GDNF (goat igG antibody to 
E. coli-derived rhGDNF, R&D Systems, Minneapolis, 
5 Minn). A partially purified 1.0 M SP Sepharose fraction 
of neurturin was used in the assay at the approximate 
concentrations of 375 pg/ml, 750 pg/ml, 1.5 ng/ml and 3 
ng/ml. This fraction was tested in the presence of 
anti-NGF alone, and in the presence of anti-NGF and 
10 anti-GDNF. The anti-GDNF antibody blocked the survival 
promoting activity of GDNF at a concentration up to 30 
ng/ml, but did not block the survival promoting activity 
of neurturin . 

15 Example 3 

This example illustrates the effect of neurturin 
on sensory neurons in a nodose ganglion survival assay. 

CHO cell conditioned media that had been partially 
purified on the SP Sepharose column was assayed for 

20 neurotrophic activity on sensory neurons using nodose 
ganglia. The survival assay is a modification of that 
previously reported above for superior cervical ganglia. 
Primary dissociated cultures of nodose ganglia were 
prepared by dissecting tissue from E18 Sprague Dawley rat 

25 pups. The nodose ganglia were placed in Leibovitz's L15 
with 2 mM 1-glutamine (Cat# 11415-023, GIBCO-BRL. 
Galthersburg , MD) as the tissues was dissected, digested 
for 30 min with 1 mg/ml collagenase (Cat#4188, 
Worthington Biochemical, Freehold, New Jersey) in 

30 Leibovitz's L15 medium at 37**C, followed by 30 min 

digestion in trypsin ( lyophilized and irradiated, type 
TRLVMF, Cat #4454 Worthington Biochemical, Freehold, N J ) , 
and resuspension to a final concentration of 0.25% in 
modified Hank's Balanced Salt Solution (Cat#H8389, Sigma 

35 Chemical Co., St. Louis, Mo). The digestion was stopped 
using AMO-BDNFIOO, a medium containing Minimum Essential 
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Medium with Earle's salts and without: 1-glutamine 
(#11090-016 GIBCO-BRL), 10% fetal Calf Serum (Cat#1115, 
Hyclone Laboratories, Logan, UT ) , 2 mM 1-glutamine 
(Cat#G5763 Sigma Chemical Co-, St. Louis, Mo.), 20 ^M 
5 FuDr (F-0503, Sigma Chemical Co.), 20 pM Uridine (Cat 
#3003, Sigma Chemical Co., St. Louis, Mo.) 100 U/ml 
penicillin, 100 pg/ml Streptomycin, and 100 ng Brain 
Derived Neurotropic Factor (BDNF, Amgen, Thousand Oaks, 
OA). The cells were dissociated into a suspension of 
10 single cells using a silanized and flame -poll shed Pasteur 
pipet in the AMO-BDNFIOO medium, and preplated on a 100 
mm Falcon or Primaria culture dish (Becton Dickinson 
Labware, Lincoln Park, NJ) to remove non-neuronal cells* 
After 2 hours, the medium containing the unattached 
15 neuronal cells was removed from these dishes and 

triturated again through a silanized and flame-polished 
Pasteur pipet. The single cell suspension was plated on 
24-well tissue culture plates (Costar, Wilmington, MA) 
that have been previously coated with a double layer of 
20 collagen, one layer of which had been ammoniated and a 
second layer that had been air dried. Ganglia from ten 
E18 rat embryos were dissociated into 2.5 mis of media 
and 100 pi of this suspension was added to each well- 
The cells were allowed to attach for 30 min in a 37**C 
25 incubator with 5% C02/95% air. The wells were fed with 
AMO-BDNFIOO media overnight. 

The next day the cells were washed 3 times for 20 
min each time with AMO medium containing no BDNF. The 
wells were fed with 0.5 ml of this media alone or this 
30 media containing either 50 ng/ml NGF, 100 ng/ml BDNF 

(Amgen, Thousand Oaks, CA), 100 ng/ml GDNF ( Prepro Tech, 
Inc., Rocky Hill, N.J) or 3 ng/ml Neurturin. The cells 
were incubated at 37** C in a 5% C02/95% air incubator for 3 
days, fixed with 10% formalin, stained with crystal 
35 violet (1 vil/ml 10% formalin) and counted. Survival was 
ascertained as noted previously. 
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The neuronal Death at 72 hours is shown in Figure 
10. Neuronal survival of nodose neurons cultured in 
BDNF has been previously reported (Thaler et al. Develop 
Biol 161:338-344^ 1994 which is incorporated by 
5 reference). This was used as the standard for survival 
for these neurons and given the value of 100% survival . 
Nodose ganglia that had no trophic support (AMD) showed 
20%-30% survival, as did neurons that were cultured in 
•the presence of 50 ng/ml NGF. Neurons cultured in the 

10 presence of 3 ng/ml neurturin and absence of BDNF showed 
survival similar to those neurons cultured in the 
presence of BDNF ( 100 ng/ml ) . GDNF at a concentration of 
100 ng/ml promoted greater survival of nodose neurons 
than did BDNF ( 100 ng/ml ) . Similar findings with GDNF 

15 were recently reported for sensory neurons from chicken 
(Ebendal, T. et al, J Neuroscl Res 40:276-284 1995 which 
is incorporated by reference ) ♦ 

Example 4 

20 This example illustrates the determination of 

partial amino acid sequences of neurturin isolated from 
CHO cell conditioned medium. 

To obtain N-terminal amino acid sequence from a 
purified preparation of approximately 1 pg of neurturin, 

25 the Mono S fractions 26-29 containing the peak of 
activity were concentrated to 25 pi by centrifuge 
ultrafiltration in a microcon-3 concentrators (Amicon, 
Inc., Beverley, MA) and loaded onto a non-reducing 14% 
SDS polyacrylamide gel. After electrophoretic 

30 separation, proteins were electroblotted to a PVDF 

membrane (Bio-Rad, Hercules, CA) and stained with 0.1% 
Coomassie Blue. The 25 kD band was excised and inserted 
into the reaction cartridge of an automated sequencer 
(Model 476, Applied Biosystems (Foster City, CA). 

35 Phenylthiohydantoin-amino acid (PTH-aa) recovery in the 
first 2-3 cycles of automated sequencing by Edman 
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degradation indicated a sequencing yield of 4 pmoles, 
which was approximately 10% of the estimated amount of 
protein loaded on the SDS gel . 

Two N-terminal sequencing runs were performed from 
5 two 50 liter purification preparations ♦ In the first 
run, 1 pg of protein in 3 pooled fractions of 1 . 5 ml 
total volume were concentrated to 25 \xl and 
electroblotted at lOOV for 2 hrs at 25 *C using an 
electroblot buffer of 10 mM CAPS pH 11.0 buffer (Sigma, 
10 St. Louis, MO) containing 5% methanol. The amino acid 

sequence was obtained from 13 cycles of Edman degradation 
and the sequencing yield was 4 pmoles as above. 

In the second run, 1.5 ]ig of protein in 4 pooled 
fractions of 2.0 ml total volume were concentrated to 25 
15 }il and electroblotted at 36V for 12 hours at 4°C using an 
electroblot buffer of 25 mM Tris, 192 mM glycine, 0.04% 
SDS and 17% MeOH. Sequencing yield was 15 pmoles and the 
sequence after 16 cycles was SGARPXGLRELEVSVS ( SEQ ID 
N0:3). The sequence obtained after 16 cycles 
20 corresponded to the shorter sequence obtained in the 

first run. Definite assignments could not be made at 3 
of the amino acid residues in the sequence (residues 1, 6 
and 11 from the N-terminal). A search of protein 
databases did not detect any significantly homologous 
25 sequences, suggesting that the purified factor was a 
novel protein . 

This initial N-terminal amino acid sequence data 
did not enable the isolation of cDNA clones using 
degenerate oligonucleotides as PGR primers or probes for 
30 screening libraries. To facilitate these approaches, 
additional protein was purified in order to obtain 
internal amino acid sequence from proteolytic fragments. 
To obtain internal amino acid sequence from neurturin, an 
additional 50 liters of CHO cell conditioned medium was 
35 purified using only the first 3 chromatographic steps as 
outlined above, except that the gradient used to elute 
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t:he Cu++ Chelating Superose column was as follows: 0-60 
mM glycine (4 ml), 60mM glycine (10ml), 60-300 mM glycine 
(32 ml). Fractions No. 20-23 containing neurturin were 
concentrated to 25 \il by ultrafiltration (Amicon microcon 
5 3, Amicon, Beverley, MA) and loaded on a non-reducing SDS 
polyacrylamide gel. After electrophoresis, the gel was 
stained with Coomassie blue and the 25 kD neurturin band 
was excised. Neurturin was digested in the gel slice 
with endoproteinase Lys-C, and the eluted proteolytic 

10 fragments were purified by reverse phase HPLC. Only one 
peak was observed upon HPLC separation of the eluted 
peptides, which yielded amino acid sequence information 
for 23 cycles at the 1 pmole signal level using the 
automated sequencer, (internal fragment P2, SEQ ID N0:5). 

15 Amino acid analysis performed on 10% of the above 

sample before subjecting it to digestion had indicated 
that 150 pmoles of protein were present in the gel slice, 
consisting of 7.6% lysine and 19.5% arginine. The single 
low level peak from the Lys-C digestion suggested that 

20 the digestion and elution of peptides were inefficient. 
The same gel slice was redigested with trypsin and the 
eluted peptides separated by HPLC. Two peaks were 
observed on HPLC, resulting in the elucidation of two 
additional 10 residue amino acid sequences (4-5 pmole 

25 signal level, internal fragment PI, SEQ ID NO: 4 and 
internal fragment P3, SEQ ID NO: 6) that were distinct 
from the N-terminal and previous internal amino acid 
sequences. The in situ digestion, elution and 
purification of peptides, and peptide sequencing was 

30 performed by the W.M, Keck Foundation Biotechnology 
Resource Laboratory at Yale University according to 
standard protocols for this service. 



35 
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Example 5 

The following example illustrates the isolation 
and sequence analysis of mouse and human neurturin cDNA 
clones . 

5 Degenerate oligonucleotides corresponding to 

various stretches of confident amino acid sequence data 
were synthesized and used as primers in the polymerase 
chain reaction (PGR) to amplify cDNA sequences from 
reverse transcribed mRNA. A forward primer (M1676; 

10 5 ' -CCNACNGCNTAYGARGA, SEQ ID NO: 50) corresponding to 

peptide sequence P2 Xaai-Xaa2-Val-Glu-Ala-Lys-Pro-Cys-Cys- 
Gly-Pro-Thr-Ala-Tyr-Glu-Asp-Xaaj-Val-Ser-Phe-Leu-Ser-Val 
where Xaa^ and Xaa2 were unknown, Xaag was Gin or Glu (SEQ 
ID NO: 5) in combination with a reverse primer (M1677; 

15 5 • -ARYTCYTGNARNGTRTGRTA (SEQ ID NO: 52) corresponding to 
peptide sequence P3 

(Tyr-His-Thr-Leu-Gln-Glu-Leu-Ser-Ala-Arg) (SEQ ID NO: 6) 
were used to amplify a 69 nucleotide product from cDNA 
templates derived from E21 rat and adult mouse brain. 

20 The PGR parameters were: 94**C for 30 sec; 55 °C for 30 
sec; 72**C for 1 min for 35 cycles. The product was 
subcloned into the Bluescript KS plasmid and sequenced. 
All nucleotide sequencing was performed using fluorescent 
dye terminator technology per manufacturer's instructions 

25 on an Applied Biosystems automated sequencer Model #373 
(Applied Biosystems, Foster City, OA). Plasmid DNA for 
sequencing was prepared using the Wizard Miniprep kit 
(Promega Corp., Madison, WI ) according to the 
manufacturer's instructions. The sequence of the 

30 amplified product correctly predicted amino acid sequence 
data internal to the PGR primers. 

Primers corresponding to the amplified sequence 
were used in combination with the degenerate primers in 
the rapid amplification of cDNA ends (RACE) technique 

35 (Frohman, M.A. Methods In Enzymology 218:340-356, 1993) 
using the Marathon RACE kit (CLONTECH, Palo Alto, CA) per 
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the manufacturer's instructions, except that first strand 
cDNA synthesis was carried out at 50 °C using Superscript 
li reverse transcriptase (Gibco-BRL)- Briefly, a double 
stranded adaptor oligonucleotide was ligated to the ends 
5 of double stranded cDNA synthesized from postnatal day 1 
rat brain mRNA. Using nested forward neurturin PGR 
primers (M1676; 5 ' -CCNACNGCNTAYGARGA, SEQ ID NO: 50 and 
1678; 5 ' -GACGAGGGTCCTTCCTGGACGTACACA, SEQ ID NO: 53) in 
combination with primers to the ligated adaptor supplied 
10 in the kit (API, AP2 ) , the 3' end of the neurturin cDNA 
was amplified by two successive PGR reactions (1st: M1676 
and API, using 94**G for 30 sec, 55 °C for 30 sec and 72 *G 
for 2 min for 35 cycles; 2nd: M1678 and AP2 using 94**G 
for 30 sec and 68*'C for 2 min for 35 cycles). A 5' 
15 portion of the rat neurturin cDNA was obtained by two 
successive PGR reactions using the linkered cDNA as 
template. The 1st reaction utilized primers M1677 (SEQ 
ID NO: 52) and API; using 94**C for 30 sec; 55^0 for 30 
sec; and 72**C for 2 min for 35 cycles. The 2nd reaction 
20 used M1679 5 ' -TAGCGGGTGTGTACGTGGAGGAAGGACACCTGGT (SEQ ID 
NO: 54) and AP2 at 94 for 30 sec and 68 °C for 2 min for 
35 cycles. These reactions resulted in a truncated form 
of the 5* end of the neurturin cDNA, apparently the 
result of premature termination of the cDNA during 
25 reverse transcription. The 5' and 3' RAGE products were 
subcloned into the plasmid Bluescript KS and sequenced. 
The sequence of these 3' and 5' RAGE products resulted in 
a partial rat neurturin cDNA sequence of 220 nt. Primers 
(#467921 5'-GAGGGAGGACGGGTGGGGAAAGAGGG, SEQ ID NO: 55; and 
30 M1679 (SEQ ID NO: 54) corresponding to the partial rat 
cDNA sequence were used (PGR parameters 94 ^'C for 30 sec 
and 68**G for 1 min for 35 cycles) to amplify a 101 
nucleotide PGR product from mouse genomic DNA which was 
homologous to rat neurturin cDNA sequence. 
35 These primers were then used to obtain murine 

neurturin genomic clones by amplifying gene fragments in 
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a mouse 129 /Sv library in a PI bacteriophage vector 
(library screening service of Genome Systems, Inc., St. 
Louis, MO). A 1.6 kb Nco I fragment from this PI clone 
containing the neurturin gene was identified by 
5 hybridization with primer (#465782; 

5 ' -TAYGARGACGAGGTGTCCTTCCTGGACGTACACAGCCGCTAYCAYAC , SEQ 
ID NO: 56). This Nco I fragment was sequenced and found 
to contain a stretch of coding sequence corresponding to 
the N-terminal and internal amino acid sequences obtained 

10 from sequencing the active protein isolated from CHO cell 
conditioned media. Beginning at the N-terminal amino 
acid sequence of the purified protein, this nucleotide 
sequence encodes a 100 amino acid protein with a 
predicted molecular mass of 11.5 kD. A search of protein 

15 and nucleic acid databases identified neurturin as a 
novel protein that is approximately 40% identical to 
glial derived neurotrophic factor ( GDNF ) . GDNF was 
purified and cloned as a factor which promotes the 
survival of midbrain dopaminergic neurons and is a 

20 distantly related member of the TGF-B super family, which 
now includes more than 25 different genes that possess a 
wide variety of proliferative and dif f erentative 
activities. Although GDNF is less than 20% identical to 
any other member of the TGF-6 family, it contains the 7 

25 cysteine residues which are conserved across the entire 
family and believed to be the basis of a conserved 
cysteine knot structure observed in the crystal structure 
determination of TGF-B2. Neurturin also contains these 7 
cysteine residues, but like GDNF is less than 20% 

30 homologous to any otber member of the TGF-B family. 

Thus, neurturin and GDNF appear to represent a subfamily 
of growth factors which have significantly diverged from 
the rest of the TGF-6 superfamily. 

To determine the sequence of the full length mouse 
35 neurturin cDNA, 5' and 3' RACE PGR was performed as 

above for the rat, using nested primers predicted from 
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the mouse genomic sequence and cDNA from neona1:al mouse 
brain. The Ist reaction for the 3' end used primers: 
M1777 5 * -GCGGCCATCCGCATCTACGACCGGG ( SEQ ID NO: 57) and API 
at 94*^0 for 30 sec; SS^'C for 15 sec; and 68*^C for 2 min 
5 for 35 cycles. The 2nd reaction used primer #467921 (SEQ 
ID NO: 55) and AP2 at 94^C for 30 sec; 65**C for 15 sec; 
and 68**C for 2 min for 20 cycles. The 5' end was 
obtained using for the 1st reaction primer M1759, 
5 • -CRTAGGCCGTCGGGCGRCARCACGGGT (SEQ ID NO: 58) and API at 

10 94**C for 30 sec; 65**C for 15 sec; and 68^*0 for 2 min for 
35 cycles- The 2nd reaction used primer M1785, 
5 • -GCGCCGAAGGCCCAGGTCGTAGATGCG (SEQ ID NO: 59) and AP2 at 
94*^0 for 30 sec; 65 °C for 15 sec; and 68**C for 2 min for 
20 cycles. Both sets of PGR reactions included 5% DMSO. 

15 The 5' and 3' mouse RACE products were subcloned into the 
plasmid Bluescript KS and sequenced. Using the sequence 
of RACE products, a 1.0 kb mouse neurturin cDNA sequence 
can be assembled. This cDNA sequence contains an open 
reading frame of 585 nucleotides that encodes a protein 

20 with a molecular mass of 24 kD. This full length mouse 
cDNA sequence is shown in Figure 7 (SEQ ID NO: 12). 
Consistent with the processing events known to occur for 
TGF-B family members, the 24 kD neurturin protein 
contains an amino terminal 19 amino acid signal sequence 

25 followed by a pro-domain which contains an RXXR 

proteolytic processing site immediately before the 
N-teirminal amino acid sequence obtained when sequencing 
the protein purified from CHO cell conditioned media. 
Using these landmarks, the 11.5 kD mature neurturin 

30 molecule is predicted to be 11.5 kD and, by analogy to 

other members of the TGF-S family, is predicted to form a 
disulfide linked homodimer of 23 kD, consistent with the 
25 kD mass of the protein purified from CHO cell 
conditioned media as estimated by SDS-PAGE analysis . 
35 For isolation of human genomic clones, primers 

( #467524 ; 5 ' -CGCTACTGCGCAGGCGCGTGCGARGCGGC, SEQ ID NO : 60 
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and #10005, 5 ' -CGCCGACAGCTCTTGCAGCGTRTGGTA, SEQ ID NO: 61) 
predicted from the sequence of mouse neurturin were used 
to" amplify (PGR parameters: Initial denaturation at 95 °C 
for 1 min 30 sec followed by 94 '^C for 30 sec; eO^'C for 15 
5 sec; and 68^*0 for 60 sec for 35 cycles) a 192 nucleotide 
fragment from human genomic DNA. The sequence of the PGR 
product demonstrated that it was the human homolog of 
mouse neurturin. The primers were then used to screen a 
human genomic library constructed in the PI vector 
10 (library screening service. Genome Systems, Inc.) and two 
clones containing the human neurturin genomic locus were 
obtained . 

The same strategy was used to determine the human 
sequence as discussed above for the mouse sequence. An 

15 oligo (#30152, GACCTGGGCCTGGGCTACGCGTCCGACGAG , SEQ ID 

NO: 62) was used as a probe in a Southern blot analysis to 
identify restriction fragments of the PI Clones which 
contained the human neurturin coding sequence. These 
restriction fragments ( Eag I, Pvu II, Hind III, Kpn I) 

20 were subcloned into the Bluescript KS plasmid and 
sequenced . 

The results of subcloning and sequencing of human 
genomic fragments were as follows. The Eag I fragment 
was found to be approximately 6 kb in size with the 3 ' 

25 Eag I site located 60 bp downstream from the stop codon. 
The Pvu II fragment was approximately 3.5 kb in size with 
the 3' Pvu II site located 250 bp downstream from the 
stop codon. The Hind III fragment was approximately 4.8 
kb in size with the 3' Hind III site located 3kb 

30 downstream from the stop codon. The Kpn I fragment was 
approximately 4.2 kb in size with the 3' Kpn I site 
located 3.1 kb downstream from the stop codon. 

The second coding exon was sequenced using these 
subcloned fragments. In addition, sequence was obtained 

35 from 250 bp flanking the 3' side of the second exon. The 
sequence was also obtained from 1000 bp flanking the 5 ' 
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side of the coding exon. From these flanking sequences, 
forward primer 30341 ( 5 ' -CTGGCGTCCCAMCAAGGGTCTTCG-3 ' , SEQ 
ID NO: 71) and reverse primer 30331 (5'- 

GCCAGTGGTGCCGTCGAGGCGGG-3 ' , SEQ ID NO: 72) were designed 
5 so that the entire coding sequence of the second exon 
could be amplified by PGR. 

The first coding exon was not mapped relative to 
the restriction sites above but was contained in the Eag 
I fragment. The sequence of this exon was obtained from 

10 the subcloned Eag I fragment using the mouse primer 

466215 (5'-GGCCCAGGATGAGGCGCTGGAAGG-3' , SEQ ID NO: 73), 
which contains the ATG initiation codon. Further 
sequence of the first coding exon was obtained with 
reverse primer 20215 ( 5 ' -CCACTCCACTGCCTGAWATTCWACCCC-3 ' , 

15 SEQ ID NO: 74), designed from the sequence obtained with 
primer 466215. Forward primer 20205 (5*- 

CCATGTGATTATCGACCATTCGGC-3 ' , SEQ ID NO: 75) was designed 
from sequence obtained with primer 20215. Primers 20205 
and 20215 flank the coding sequence of the first coding 
20 exon and can be used to amplify this coding sequence 
using PGR . 

The human cDNA and inferred amino acid sequence is 
shown in Figure 7 and the mouse cDNA and inferred amino 
acid sequence is shown in Figure 8. 

25 

Example 6 

This example illustrates the preparation of 
expression vectors containing neurturin cDNA. 

30 For expression of recombinant neurturin in 

mammalian cells the neurturin vector pCMV-NTN-3-1 was 
constructed. The 585 nucleotide open reading frame of 
the neurturin cDNA was amplified by PGR using a primer 
containing the first 27 nucleotides of the neurturin 

35 coding sequence 

( 5 • -GCGACGCGTACCATGAGGCGCTGGAAGGCAGCGGCCCTG, SEQ ID 
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NO: 63) and a primer containing the last 5 codons and the 
stop COdon ( 5 * -GACGGATCCGCATCACACGCACGCGCACTC ) ( SEQ ID 
NO: 64) using reverse transcribed postnatal day 1 mouse 
brain mRNA as template using (PGR parameters: 94 °C for 30 
5 sec; 60^*0 for 15 sec; and 68**C for 2 min for 35 cycles 
and including 5% DMSO in the reaction). The PGR product 
was subcloned into the Eco RV site of BSKS and sequenced 
to verify that it contained no PGR generated mutations. 
The neurturin coding sequence was then excised from this 

10 vector using Mlu I (5' end) and Bam HI (3' end) and 

inserted downstream of the CMV IE promoter/enhancer in 
the mammalian expression vector pCB6 (Brewer, C.B. 
Methods In Cell Biology 43:233-245, 1994) to produce the 
pCMV-NTN-3-1 vector using these sites - 

15 For expression of recombinant protein in E. Coli, 

the mature coding region of mouse neurturin was amplified 
by PGR using a primer containing the first 7 codons of 
the mature coding sequence 

( 5' -GACGATATGCCGGGGGCTCGGCCTTGTGG) (SEQ ID NO: 65) and a 
20 primer containing the last 5 codons and the stop codon 

5 ' -GACGGATCCGGATCACACGCACGCGCACTC (SEQ ID NO: 66) using a 
fragment containing the murine neurturin gene as template 
using (PGR parameters: 94'*G for 30 sec; 60**G for 15 sec 
and 68**C for 90 sec for 25 cycles with 5%= DMSO added 
25 into the reaction). The amplified product was subcloned 
into the Eco RV site of BSKS, the nucleotide sequence was 
verified, and this fragment was then transferred to the 
expression vector pET-30a (Novagen, Madison, WI ) using an 
Nde 1 site (5' end) and an Eco Rl site (3* end). The 
30 pET-neurturin (pET-NTN) vector codes for an initiator 

methionine in front of the first amino acid of the mature 
mouse neurturin protein predicted from the N-terminal 
amino acid sequence of neurturin purified from the GHO 
cell conditioned media. 
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Example 7 

This example illustrates the transient 
transfection of NIH3T3 cells with the neurturin 
expression vector pCMV-NTN-3-1 and that the product of 
5 the genomic sequence in Example 5 is biologically active. 

To demonstrate that the cloned neurturin cDNA was 
sufficient to direct the synthesis of biologically active 
neurturin we transiently introduced the pCMV-NTN-3-1 
plasmid into NIH3T3 cells using the lipofectamine method 
10 of transfection. NIH3T3 cells were plated at a density 
of 400,000 cells per well (34.6 mm diameter) in 6 well 
plates (Corning, Corning, NY) 24 hours before 
transfection. DNA liposome complexes were prepared and 
added to the cells according to the manufacturer's 
15 protocol using 1.5 \xg CMV-neurturin plasmid DNA (isolated 
and purified using a Qiagen (Chatsworth, CA) tip- 500 
column according to manufacturer's protocol) and 10 pi 
lipofectamine reagent (Gibco BRL, Gaithersburg, MD) in 
1:1 DME/F12 medium containing 5 pg/ml insulin, 5 \xg/ml 
20 transferrin, and 5 ng/ml sodium selenite (Sigma, St. 
Louis, MO). Five hours after the addition of DNA 
liposome complexes in 1 ml medium per well, 1 ml DME 
medium containing 20% calf serum was added to each well. 
Twenty- four hours after the addition of DNA- liposome 
25 complexes, the 2 ml medium above was replaced with 1 ml 
DME medium containing 10% calf serum, 2 mM glutamine, 100 
U/ml penicillin, 100 p/ml streptomycin, and 25 ug/ml 
heparin. The cells were incubated for an additional 24 
hours before the conditioned medium was harvested, 
30 centrifuged to remove cellular debris, and frozen. 

As a control, NIH3T3 cells were transfected as 
above using 1 . 5 yg CMV-neo expression plasmid ( containing 
no cDNA insert ) in place of the 1.5 pg CMV-neurturin 
plasmid. Conditioned medium from NIH3T3 cells 
35 transfected with either control plasmid or CMV-neurturin 
plasmid was assayed by direct addition to the SCO culture 
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medium at "the time of NGF deprivation. Addition of 0.25 
ml conditioned medium from CMV-neurturin- transf ected 
cells promoted 70% survival of sympathetic neurons, and 
>90% survival could be obtained with 0.45 ml of this 
5 conditioned medium. No significant survival promoting 
activity was detected in the conditioned medium of 
control transf ected NIH3T3 cells. 

Example 8 

10 This example illustrates the preparation of 

Chinese hamster ovary cells stably transformed with 
neurturin cDNA. 

DG44 cells, a Chinese hamster ovary cell 
derivative that is deficient in dihydrof olate reductase 
15 (DHFR) (Urlaub et al Cell 3:405-412, 1983 which is 

incorporated by reference), were stably co-transf ected 
with expression plasmid ( pCMV-NTN-3-1 ) and a DHFR 
expression plasmid (HLD) (McArthur, and S tanners J, Biol. 
Chem. 266:6000-6005, 1991 which is incorporated by 
20 reference ) . 

On day 1 DG44 cells were plated at 1x10^ cells per 
10 cm plate in Ham's F12 medium with 10% fetal calf serum 
(FCS). This density must not be exceeded or cells will 
overgrow before selection media is added on day 5* 
25 On day 2 cells were transf ected with a 9:1 ratio 

of pCMV-NTN to DHFR expression plasmid using the calcium 
phosphate method (10 ug DNA /lO cm plate) (Chen and 
Okayama, Mol Cell Biol 7:2745-2752, 1987 which is 
incorporated by reference). 
30 On day 3 the transfected cells were washed with 

Ham's F12 medium and fed Ham's F12 with 10% FCS. 

On day 5 the cells were washed with MEM alpha 
medium and fed selection medium, which is MEM alpha with 
10% FCS and 400 ug/ml G418. The cells were maintained in 
35 selection media, feeding every 4 days. Colonies began to 
appear approximately 14 days after transf ection. 
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Colonies growing in selection media were then transferred 
-to a 24 well plate and trypsinized the next day to 
disperse the cells. The cells were grown to confluence 
in either 24 well or 6 well plates in order to screen the 
5 cells for expression of recombinant protein. Expression 
of neurturin was examined in 10 clonal lines and two high 
expressing lines were detected using the SCG survival 
assay. These clonal lines were expanded and expression 
in these selected cell lines was amplified by selection 
10 in 50 nM methotrexate (MTX). For selection in MTX, cells 
were grown to 50% confluence in a 150 cm^ flask in 
selection medium. The medium was changed to MEM alpha 
containing 50 nM MTX concentration ( it was not necessary 
to use G418 during MTX amplification). After placement 
15 in 50 nM MTX, the majority of cells died and colonies of 
resistant cells reappeared in 1-2 weeks. At this time, 
the cells were trypsinized to disperse colonies and are 
split when cells reach confluence. Cells eventually 
reached the same growth rate as before. The selected 
20 cells were screened for expression of recombinant 

protein. A 2-3 fold increase in expression was observed 
after selection in 50 nM MTX. Frozen stocks were kept 
for cell lines obtained from the original selection and 
the 50 nM MTX selection. Further selection could be 
25 continued in increasing MTX until desired levels of 
expression are obtained • 

Using the above method, we isolated cells 
identified as DG44CH05-3( G418 ) ( pCMV-NTN-3-1 ) and 
DG44CHO5-3(50nMMTX)(pCMV-NTN-3-l). Cells from the 
30 DG44CH05-3{ 50nMMTX) (pCMV-NTN-3-1 ) strain expressed levels 
of approximately 100 pg of biologically active protein 
per liter of conditioned media determined by direct assay 
of conditioned medium in SCG assay according to the 
methods in example 1 . 
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Example 9 

This example illustrates the expression of 
neurturin in various tissues. 

A survey of neurturin and GDNF expression was 
5 performed in rat embryonic tissues (ElO, day 10 after 
conception), neonatal tissues (PI, Postnatal Day 1), and 
adult tissues (> 3 mos ) using semi-quantitative RT/PCR 
(Estus et al., J Cell Biol 127:1717-1727, 1994 which is 
incorporated by reference ) . The RNA samples were 

10 obtained from various tissues and PGR products were 

detected either by autoradiography after incorporation of 
a-^^P-dCTP in the PGR and electrophoresis on a 
polyacrylamide gel ( Figure 6 ) or by ethidium bromide 
staining of DNA after electrophoresis on agarose gels 

15 (Tables 3 and 4). The neurturin fragment of 101 base 
pairs was obtained using the forward primer 
GAGGGACGAGGCGTGCGCAAAGAGCG ( SEQ ID NO: 67) and reverse 
primer TAGCGGCTGTGTACGTCCAGGAAGGACACCTCGT (SEQ ID NO: 68) 
and the GDNF fragment of 194 base pairs was obtained 

20 using the forward primer AAAAATCGGGGGTGYGTCTTA (SEQ ID 
NO: 69) and the reverse primer CATGCCTGGCCTACYTTGTCA (SEQ 
ID NO: 70). 

No neurturin or GDNF mRNA was detected at the 
earliest embryonic age (embryonic day 10, ElO) surveyed- 
25 In neonates (postnatal day 1, PI) both transcripts 

were expressed in many tissues although neurturin tended 
to show a greater expression in most tissues than did 
GDNF. (see table 3). 

30 
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Table 3. 



NEURTURIN GDNF 



Liver 


+++ 


- 


Blood 


+++ 


+ 


Thymus 


+ 




Brain 


++ 


+ 


Scia-tic 




+ 


nerve 






Kidney 


+ + 


+ + 


Spleen 


++ 


+ 


Cerebellum 


++ 


+ 


Heart: 


++ 


+ 


Bone 


+ 


+ 



15 

As shown in Table 3, differences in the tissue 
distributions of neurturin and GDNF were noted. In 
particular, no GDNF was detected in liver and thymus 
where neurturin expression was detected and no neurturin 
20 was detected in sciatic nerve where GDNF was detected. 

Neurturin and GDNF mRNA were detected in many 
tissues in the adult animal, but the tissue-specific 
pattern of expression for these two genes was very 
different, (table 4, Figure 5). 
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Table 4. 



NEURTURIN GDNF 





Li vex* 










-f 








+ 


+ + 




Brain 


+ 






nerve 






1 o 

A. \J 




H- + 


+ 








+ 




Cerebellum 


- 






Uterus 


+ + 


— 




Bone marrow 


+ + 




15 


Test:ls 


+ + 


+ + 




Ovary 


+ 


+ 




Placenta 


+ 






Skeletal 
muscle 






20 


Spinal cord 


+ 






Adrenal 
gland 


+ + 


+ + 




Gut 




+ + 



25 As shown in table 4, neurturin was found to be 

expressed in brain and spinal cord as well as in blood 
and bone marrow where no GDNF was detected. The level of 
expression of neurturin in brain and blood was, however, 
less than that detected in neonatal tissue. 

30 Neurturin was also highly expressed in freshly 

isolated rat peritoneal mast cells, whereas GDNF showed 
little or no expression. 

Example 10 

35 This example illustrates the preparation of 

antisera to neurturin by immunization of rabbits with a 
neurturin peptide. 
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The peptide sequence corresponding -to amino acids 
73-87 of the mature murine neurturin protein was 
synthesized and coupled to keyhole limpet hemocyanin 
(KLH) as described earlier (Harlow and Lane, Tintibodxes : 
5 a laboratory manual, 1988. Cold Spring Harbor Laboratory, 
New York, NY. p. 72-81 which is incorporated by 
reference). The KLH-coupled peptide was submitted to 
Caltag, Inc. and each of two rabbits were immunized. 
Immunization was by subcutaneous injection at 7-10 sites. 

10 The first injection was with 150 pg KLH-coupled peptide 
which was resuspended in 0.5 ml saline and emulsified 
with 0.5 ml complete Freund's adjuvant. Boost injections 
were begun 4 weeks after the initial injection and were 
performed once every 7 days as above for a total of 5 

15 injections except that 100 \xg of KLH-coupled peptide and 
incomplete Freund ' s adjuvant were used. Serum samples 
were collected 1 week after the fifth boost. 

A pooled volume of twenty ml of serum that had 
been collected from both rabbits one week after the 5th 

20 injection was purified. For purification, a peptide 
affinity column was prepared by coupling the above 
peptide to cyanogen bromide activated Sepharose 4B 
according to the manufacturers protocol ( Pharmacia 
Biotech). The serum was diluted 10 fold in 10 mM Tris pH 

25 7.5 buffer and mixed by gentle rocking for 16 hours at 

4*C with 0.5 ml of peptide agarose matrix containing 5 mg 
of coupled peptide. The matrix was placed into a column, 
washed with 5 ml of 10 mM Tris pH 7.5, 150 mM NaCl, 
washed with 5 ml of 10 mM Tris pH 7.5 buffer containing 

30 0.4 M NaCl and eluted with 5.5 ml of 100 mM glycine pH 
2.5 buffer. One tenth volume of l.OM Tris pH 8.0 buffer 
was added to the eluate immediately after elution to 
neutralize the pH. The glycine eluate was dialyzed 
overnight against 10 mM Tris pH 7.5, 150 mM NaCl. 

35 The affinity-purified antibodies were used in a 

v^estern blot to demonstrate specific recognition of 
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recombinant neurturin protein. Ten ml of conditioned 
medium collected from DG44CH05-3( G418 ) ( pCMV-NTN-3-1) 
cells was purified over SP Sepharose as described in 
Example 1 and the proteins electrophoresed on a reducing 
5 SDS-PAGE gel in the tricine buffer system ( Schagger and 
von Jagow Analytical Biochemistry 166:368-379, 1987). 
The proteins were electroblotted to a nitrocellulose 
membrane in 25 mM Tris, 192 mM glycine, 0,04% SDS, 17% 
methanol at 4**C for 16 hr. The membrane was incubated 

10 with the af f inity-purif ied ant i -neurturin peptide 

antibodies and then with horseradish peroxidase-coupled 
sheep anti-rabbit IgG (Harlow and Lane, supra, p. 
498-510). Bound antibodies were detected with enhanced 
chemi luminescence (ECL kit, Amersham, Buckinghamshire, 

15 England). The anti-neurturin antibodies recognized a 
single, approximately 11,5 kD protein band in the 
conditioned medium of the DG44CH05-3( G41B ) ( pCMV-NTN-3-1 ) 
cells. Using these anti-neurturin antibodies, neurturin 
protein could be detected in 10 ml of conditioned medium 

20 from DG44CH05-3( G418 ) ( pCMV-NTN-3-1) cells but could not 
be detected in 10 ml of medium conditioned with DG44 
cells that had not been transformed with the neurturin 
expression vector. 

25 Example 11 

The following example illustrates the 
identification of additional members of the 
GDNF/neurturin/persephin gene subfamily. 

The TGF-6 super family currently contains over 25 

30 different gene members (for review see Kingsley, Genes 

and Development 8: 133-146, 1994 which is incorporated by 
reference). The individual family members display 
varying degrees of homology with each other and several 
subgroups within the superfamily can be defined by 

35 phylogenetic analysis using the Clustal V program 

(Higgins et al, Comput Appl Bloscl 8: 189-191, 1992 which 
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is incorpora-ted by reference) and by bootstrap analysis 
of phylogenetic trees ( Felsenstein, Evolution 39:783-791, 
1985 which is incorporated by reference). Neurturin or 
persephin is approximately 40% identical to GDNF but less 
5 than 20% identical to any other member of the TGF-6 

super family. Several sequence regions in neurturin can 
be identified (Figure 5) that are highly conserved within 
the GDNF/neurturin/persephin subfamily but not within the 
TGF-6 superfamily. These conserved regions are likely to 

10 characterize a subfamily containing previously unisolated 
genes, which can now be isolated using the conserved 
sequence regions identified by the discovery and 
sequencing of the neurturin and persephin genes. Regions 
of high sequence conservation between neurturin, 

15 persephin and GDNF allow the design of degenerate 

oligonucleotides which can be used either as probes or 
primers. Conserved- region amino acid sequences have been 
identified herein to include Val-Xaai-Xaa2-Leu-Gly-Leu- 
Gly-Tyr where Xaa^ is Ser, Thr or Ala and Xaa^ is Glu or 

20 Asp (SEQ ID NO: 108); Glu-XaBj -Xaa2-Xaa3-Phe-Arg-Tyr-Cys- 

Xaa4-Gly-Xaa5-Cys in which Xaa^ is Thr, Glu or lys, Xaa2 is 
Val, Leu or lie, Xaa3 is Leu or lie, Xaa^ is Ala or Ser, 
and Xaa^ is Ala or Ser, (SEQ ID NO: 113); and Cys-Cys-Xaa^- 
Pro-Xaaj-Xaaa-Xaa^-Xaas-Asp-Xaa^-Xaa^-Xaag-Phe-Leu-Asp-Xaa, 

25 in which Xaa^ is Arg or Gin, Xaa2 is Thr or Val or lie, 
Xaa3 is Ala or Ser, Xaa^ is Tyr or Phe, Xaag is Glu, Asp 
or Ala, Xaa^ is Glu, Asp or no amino acid, Xaa^ is val or 
leu, Xaag is Ser or Thr, and Xaa^ is Asp or Val (SEQ ID 
N0:114)« Nucleotide sequences containing a coding 

30 sequence for the above conserved sequences or fragments 
of the above conserved sequences can be used as probes. 
Exemplary probe and primer sequences which can be 
designed from these regions are as follows. 
Forward primers, 

35 Primer A ( M3119 ) : 5 ' -GTNDGNGANYTGGGNYTGGGNTA ( SEQ 

ID NO: 115) 23 nt which codes for the amino acid 
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sequence, Val-Xaai-Xaaj-Leu-Gly-Leu-Gly-Tyr where 
Xaa^ is Thr, Ser or Ala and Xaaj is Glu or Asp 
(SEQ ID NO: 125); 

Primer B (M3123): 5 ^ -GANBTNWCNTTYYTNGANG (SEQ ID 
NO: 116) 19 nt which codes for the amino acid 
sequence^ Xaa^-Xaaz-Xaas-Phe-Leu-Xaa^-Xaas where 
Xaai is Asp or Glu, Xaa2 is Val or Leu, Xaa^ is Thr 
or Ser, Xaa4 is Asp or Glu, and Xaa^ is Asp or Val 
(SEQ ID NO: 126); 

Primer C (M3126): 5 ' -GANBTNWCNTTYYTNGANGW (SEQ ID 
NO: 117) 20 nt which codes for the amino acid 
sequence, Xaa^-Xaaj-Xaaa-Phe-Leu-Xaa^-Xaas where 
Xaa^ is Asp or Glu, Xaaz is Val or Leu, Xaa3 is Thr 
or Ser, Xaa4 is Asp or Glu, and Xaag is Asp or Val 
(SEQ ID NO: 126); 

Primer D (M3121): 5 » -TTYMGNTAYTGYDSNGGNDSNTG (SEQ 
ID NO: 118) 23 nt which codes for the amino acid 
sequence, Phe-Arg-Tyr-Cys-Xaai-Gly-Xaa2-Cys where 
Xaa^ is Ser or Ala and Xaaj is Ser or Ala (SEQ ID 
NO: 127); 

Primer E (M3122): 5 ' -GTNDGNGANYTGGGNYTNGG (SEQ ID 
NO: 119) 20 nt which codes for the amino acid 
sequence, Val-Xaai-Xaa2-Leu-Gly-Leu-Gly where Xaaj 
is Thr, Ser or Ala and Xaaj is Asp or Glu (SEQ ID 
NO: 128); and 

Primer F (M3176): 5 ' -GTNDGNGANYTGGGNYTGGGNTT (SEQ 
ID NO: 120) 23 nt which codes for the amino acid 
sequence, Val-Xaai-Xaaj-Leu-Gly-Leu-Gly-Phe where 
Xaa^ is Thr, Ser or Ala and Xaa2 is Glu or Asp 
(SEQ ID NO: 129 ) • 
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Reverse primers. 

Primer G (M3125): 5 ' -WCNTCNARRAANGWNAVNTC ( SEQ ID 
NO: 121) 20 nt whose reverse complementary sequence 
codes for the amino acid sequence, 
5 Xaai-Xaa2-Xaa3-Phe-Leu-Xaa4-Xaa5 where Xaa^ is Asp 

or Glu, Xaa2 is Val or Leu, Xaaj is Thr or Ser, 
Xaa^ is Asp or Glu, and Xaag is Asp or Val (SEQ ID 
NO: 126); 

10 Primer H (M3124): 5 ' -WCNTCNARRAANGWNAVNT (SEQ ID 

NO: 122) 19 nt whose reverse complementary sequence 
codes for the amino acid sequence, 
Xaai-Xaa2-Xaa3-Phe-Leu-Xaa4-Xaa5 where Xaaj is Asp 
or Glu, Xaa2 is Val or Leu, Xaaj is Thr or Ser, 

15 Xaa4 is Asp or Glu, and Xaag is Asp or Val (SEQ ID 

NO: 126); 

Primer I (M3120): 5 ' -CANSHNCCNSHRCARTANCKRAA (SEQ 
ID NO: 123) 23 nt whose reverse complementary 
20 sequence codes for the amino acid sequence, 

Phe-Arg-Tyr-Cys-Xaai-Gly-Xaa2-Cys where Xaa^ is Ser 
or Ala and Xaa2 is Ser or Ala (SEQ ID NO: 127); and 

Primer J ( M3 1 1 8 ) : 5 ' - CANSHNCCNSHRCART ANCKRAANA 
25 (SEQ ID NO: 124) 25 nt whose reverse complementary 

sequence codes for the amino acid sequence, 
Xaai-Phe-Arg-Tyr-Cys-Xaa2-Gly-Xaa3-Cys where Xaa^ 
is lie or Leu, Xaa2 is Ser or Ala and Xaa^ is Ser 
or Ala (SEQ ID NO: 130). 
30 In addition to the above, the following primers 

are based upon conserved regions in GDNF and neurturin 
(SEQ ID NOS: 33-35). 

Primer 1, GTNWSNGANYTNGGNYTNGGNTA (SEQ ID NO: 42) 
which encodes the amino acid sequence, Val-Xaa^- 
35 Xaa2-Leu-Gly-Leu-Gly-Tyr where Xaa^ is Ser or Thr 

and Xaa2 is Glu or Asp (SEQ ID NO: 33); 
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Primer 2, TTYMGNTAYTGYDSNGGNDSNTGYGANKCNGC ( SEQ ID 
NO: 43) which encodes amino acid sequence Phe-Arg- 
Tyr-Cys-Xaai-Gly-Xaa2"Cys-Xaa3-Xaa4-Ala where Xaa^ 
is Ala or Ser, Xaa2 is Ala or Ser, Xaa3 is Glu or 
Asp and Xaa4 is Ser or Ala (SEQ ID NO:36); 

Primer 3 reverse GCNGMNTCRCANSHNCCNSHRTANCKRAA 
(SEQ ID NO: 44) whose reverse complementary 
sequence encodes amino acid sequence Phe-Arg-Tyr- 
Cys-Xaai-Gly-Xaa2-Cys-Xaa3-Xaa4-Ala where Xaa^ is 
Ala or Ser, Xaa2 is Ala or Ser, Xaag is Glu or Asp 
and Xaa4 is Ser or Ala (SEQ ID N0:37); 

Primer 4 reverse TCRTCNTCRWANGCNRYNGGNCKCARCA (SEQ 
ID NO: 45) whose reverse complementary sequence 
encodes amino acid sequence Cys-Cys-Arg-Pro-Xaa^- 
Ala-Xaa2-Xaa3-Asp-Xaa4 where Xaa^ is lie or Thr or 
Val, Xaa2 Try or Phe, Xaag is Glu or Asp and Xaa^ 
is Glu or Asp (SEQ ID N0:38); 

Primer 5 reverse TCNARRAANSWNAVNTCRTCNTCRWANGC 
(SEQ ID NO: 46) whose reverse complementary 
sequence encodes amino acid sequence Ala-Xaa^- 
Xaa2-Asp-Xaa3-Xaa4-Ser-Phe-Leu-Asp where Xaa^ is 
Tyr or Phe, Xaa2 Glu or Asp, Xaaj is Glu or Asp, 
and Xaa4 is Val or Leu (SEQ ID NO:39); 

Primer 6 GARRMNBTNHTNTTYMGNTAYTG (SEQ ID NO: 47) 
which encodes amino acid sequence Glu-Xaai-Xaa2-- 
Xaa3-Phe-Arg-Tyr-Cys where Xaa^ is Glu or Thr, Xaaj 
is Leu or Val and Xaaj is lie or Leu (SEQ ID 
NO:40); 

Primer 7 GARRMNBTNHTNTTYMGNTAYTGYDSNGGNDSNTGHGA 
(SEQ ID NO: 48) which encodes amino acid sequence 
Glu'-Xaai-Xaa2-Xaa3-Phe-Arg-Tyr-Cys--Xaa4-Gly-Xaa5- 
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Cys-Xaa^ where Xaa^ is Glu or Thr, Xaa^ is Leu or 
Val^ Xaa3 is lie or Leu, Xaa4 is Ser or Ala, Xaag 
is Ser or Ala and Xaa^ is Glu or Asp (SEQ ID 
NO:41 ) . 

5 The above sequences can be used as probes for 

screening libraries of genomic clones or as primers for 
amplifying gene fragments from genomic DNA or libraries 
of genomic clones or from reverse transcribed cDNA using 
RNA templates from a variety of tissues. Genomic DNA or 
10 libraries of genomic clones can be used as templates 
because the neurturin, persephin and GDNF coding 
sequences for the mature proteins are not interrupted by 
introns . 

A degenerate oligonucleotide can be synthesized as 

15 a mixture of oligonucleotides containing all of the 
possible nucleotide sequences which code for the 
conserved amino acid sequence. To reduce the number of 
different oligonucleotides in a degenerate mix, an 
inosine or universal base ( Loakes et al. Nucleic Acids 

20 Res 22:4039-43, 1994) can be incorporated in the 

synthesis at positions where all four nucleotides are 
possible. The inosine or universal base forms base pairs 
with each of the four normal DNA bases which are less 
stabilizing than AT and GC base pairs but which are also 

25 less destabilizing than mismatches between the normal 
bases (l^e. AG, AC, TG, TC). 

To isolate family members a primer above can be 
end labeled with ^^P using T4 poJ.ynucleotide kinase and 
hybridized to libraries of human genomic clones according 

30 to standard procedures. 

A preferred method for isolating family member 
genes would be to use various combinations of the 
degenerate primers above as primers in the polymerase 
chain reaction using genomic DNA as a template. The 

35 various combinations of primers can include sequential 
PGR reactions utilizing nested primers or the use of a 
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forward primer paired with an oligo dT primer. In 
addition, one of the degenerate primers can be used with 
a vector primer, a single primer can be used in an 
inverted PGR assay or PGR can be performed with one 
5 degenerate primer and a random primer. As an example 
using the above set of primers, primer 2 (SEQ ID NO: 43) 
can be used with primer 4 (SEQ ID NO: 45) in PGR with 1 ug 
of human genomic DNA and cycling parameters of 94^*0 for 
30 sec, 50^*0 for 30 sec, and 72 *G for 50 sec. The above 

10 PGR conditions are exemplary only and one skilled in the 
art will readily appreciate that a range of suitable 
conditions and primer combinations could be used or 
optimized such as different temperatures and varying salt 
concentrations in the buffer medium and the like. It is 

15 preferred that DMSO be added to the PGR reaction to a 

final concentration of 5% inasmuch as this was found to 
be necessary for amplification of this region of the 
neurturin gene« The PGR reaction, when run on an agarose 
gel, should contain products in the size range of 100-150 

20 base pairs since a one amino acid gap is introduced in 
the neurturin sequence and a five amino acid gap is 
introduced in the persephin sequence when either sequence 
is aligned with GDNF, and thus family member genes might 
also contain a slightly variable spacing between the 

25 conserved sequences of primers 2 and 4. The PGR products 
in the range of 100-150 base pairs should contain 
multiple amplified gene products including GDNF, 
neurturin and persephin as well as previously unisolated 
family members. To identify sequences of these products, 

30 they can be gel purified and ligated into the Bluescript 
plasmid ( Stratagene ) , and then transformed into the 
XLl-blue E. Goli host strain (Stratagene). Bacterial 
colonies containing individual subclones can be picked 
for isolation and plated on nitrocellulose filters in two 

35 replicas. Each of the replicate filters can be screened 
with an oligonucleotide probe for either unique GDNF or 
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unique neurt:urin or unique persephin sequence in the 
amplified region. Subclones not hybridizing to either 
GDNF or neurturin or persephin can be sequenced and if 
found to encode previously unisolated family members, the 
5 sequence can be used to isolate full length cDNA clones 
and genomic clones as was done for neurturin ( Example 5 ) . 
A similar method was used to isolate new gene members 
(GDF-3 and GDF-9 ) of the TGF-B superfamily based on 
homology between previously identified genes (McPherron J 
10 B±ol Chem 268: 3444-3449, 1993 which is incorporated by 
reference ) . 

The inventors herein believe that the most 
preferred way to isolate family member genes may be to 
apply the above PGR procedure as a screening method to 
15 isolate individual family member genomic clones from a 

library. This is because there is only one exon for the 
coding region of both mature neurturin and GDNF, If, for 
example, the above PGR reaction with primers 2 and 4 
generates products of the appropriate size using human 
20 genomic DNA as template, the same reaction can be 

performed using, as template, pools of genomic clones in 
the PI vector according to methods well known in the art, 
for example that used for isolating neurturin human 
genomic clones ( Example 5 ) . Pools containing the 
25 neurturin gene in this library have previously been 

identified and persephin and GDNF-containing pools can be 
readily identified by screening with GDNF and PSP 
specific primers. Thus non-neurturin, non-persephin, 
non-GDNF pools which generate a product of the correct 
30 size using the degenerate primers will be readily 

recognized as previously unisolated family members. The 
PGR products generated from these pools can be sequenced 
directly using the automated sequencer and genomic clones 
can be isolated by further subdivision and screening of 
35 the pooled clones as a standard service offered by Genome 
Systems, Inc. 
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Example 12 

The following example illustirates t;he isolation 
aiid identification of persephin utilizing the procedures 
and primers described in Example 11, 
5 The degenerate PCR strategy devised by the 

inventors herein has now been successfully utilized to 
identify a third factor, persephin, that is approximately 
35-50% identical to both GDNF and neurturin. The 
experimental approach was described above and is provided 
10 in greater detail as follows. Primers corresponding to 
the amino acid sequence Val-Xaal-Xaa2-Leu-Gly-Leu-Gly-Tyr 
where Xaal is Ser or Thr and Xaa2 is Glu or Asp ( SEQ ID 
N0:33) tM1996; 5 ' -GTNWSNGANYTNGGNYTNGGNTA (SEQ ID NO: 42)] 
and Phe-Arg-Tyr-Cys-Xaal-Gly~Xaa2-Cys-Xaa3-Xaa4-Ala where 
15 Xaal is Ala or Ser, Xaa2 is Ala or Ser, Xaa3 is Glu or 
Asp and Xaa4 is Ser or Ala (SEQ ID NO: 37) [1X11999; 5'- 
GCNGMNTCRCANSHNCCNSHRCARTANCKRAA (SEQ ID NO : 44 ) ] were 
used to amplify a 77 nt fragment from rat genomic DNA 
using Klentaq enzyme and buffer under the following 
20 conditions: 94*^0 for 30 sec; 44**C for 30 sec; 72**C for 
30 sec for 40 cycles. The resulting product was 
subcloned into the Bluescript KS plasmid and sequenced. 
All nucleotide sequencing was performed using fluorescent 
dye terminator technology per manufacturer's instructions 
25 on an Applied Biosystems automated sequencer Model #373 
(Applied Biosystems, Foster City, CA). Plasmid DNA for 
sequencing was prepared using the Wizard Miniprep kit 
(Promega Corp., Madison, WI ) according to the 
manufacturer ' s instructions • 
30 The sequence of one of the amplified products 

predicted amino acid sequence data internal to the PCR 
primers that was different from that of GDNF or neurturin 
but had more than 20% identity with GDNF and neurturin, 
whereas the sequences of others we obtained corresponded 
35 to GDNF or neurturin, as would be expected. The novel 



BNSDOCID: <WO_991423SAl_L> 



wo 99/14235 



PCT/US98/19163 



100 

sequence was "thought: to identify a new member of this 

family which we named persephin* 

The sequence of this fragment internal to the 

primers was 5 ' -TGCCTCAGAGGAGAAGATTATC ( SEQ ID NO: 90). 

5 This encodes the last nucleotide of the Tyr codon, and 

then encodes the amino acids: Ala-Ser-Glu-Glu-Lys-Ile- 

Ile (SEQ ID N0:91)- This sequence was then aligned with 

the rat sequences of GDNF and neurturin. This analysis 

confirmed that persephin was unique. 

10 LGLGYETKEELIFRYC GDNF (rat) (SEQ ID NO: 92) 
LGLGYTSDETVLFRYC NTN ( rat ) ( SEQ ID NO : 93 ) 
LGLGYASEEKIIFRYC PSP (rat) (SEQ ID NO: 94) 

To obtain additional persephin sequence, primers 

15 containing portions of the unique 22 nt of the amplified 
fragment above were used in the rapid amplification of 
cDNA ends (RACE) technique (Frohman, M.A. Methods ±n 
Enzymology 218:340-356, 1993) using the Marathon RACE kit 
(CliONTECH, Palo Alto, CA ) per the manufacturer's 

20 instructions, except that first strand cDNA synthesis was 
carried out at 50 *C using Superscript II reverse 
transcriptase (Gibco-BRL). Briefly, a double stranded 
adaptor oligonucleotide was ligated to the ends of double 
stranded cDNA synthesized from postnatal day 1 rat brain 

25 mRNA. Using nested forward persephin PGR primers, 

(10135; 5' -AGTCGGGGTTGGGGTATGCCTCA, SEQ ID NO: 95 and 
M2026; 5 ' -TATGCCTCAGAGGAGAAGATTATCTT SEQ ID NO: 96) in 
combination with primers to the ligated adaptor supplied 
in the kit (API, AP2 ) , the 3' end of the persephin cDNA 

30 was amplified by two successive PGR reactions (1st: 

10135 and API, using 94 "C for 30 sec, 60 ""C for 15 sec and 
68**C for 2 min for 35 cycles; 2nd: M2026 and AP2 using 
94**C for 30 sec, 60 for 15 sec and 68**C for 2 min for 21 
cycles). An approximately 350 nt fragment was obtained 
35 from this PGR reaction and this fragment was directly 
sequenced using primer M2026. The sequence of this 3* 
RACE product resulted in a partial rat persephin cDNA 
sequence of approximately 350 nt (SEQ ID NO: 97). The 
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predicted amino acid sequence of this cDNA was compared 
-to that of GDNF and neurturin, and found to be 
approximately 40% homologous to each of these proteins. 
Importantly, the characteristic spacing of the cyteine 
5 residues in members of the TGF-p superfamily was present. 
Furthermore, in addition to the region of similarity 
encoded by the degenerate primers used to isolate 
persephin, another region of high homology shared between 
GDNF and neurturin, but absent in other members of the 
10 TGF-p superfamily, was also present in persephin 

GDNF ACCRPVAFDDDLSFLDD ( aa 60-76) ( SEQ ID NO: 98) 
NTN PCCRPTAYEDEVSFKDV ( aa 61-77) (SEQ ID NO: 99) 
PSP PCCQPTSYAD-VTFLDD ( aa 57-72) (SEQ ID NO: 100) 

15 

(Amino acid numbering uses the first Cys residue as ammo 
acid 1 ) . 

With the confirmation that persephin was indeed a 
20 new member of the GDNF/neurturin subfamily, we isolated 
murine genomic clones of persephin to obtain additional 
sequence information. Primers (forward, M2026; 5'- 
TATGCCTCAGAGGAGAAGATTATCTT , SEQ ID NO: 96 and reverse, 
M3028; 5' -TCATCAAGGAAGGTCACATCAGCATA, SEQ ID NO: 101) 
25 corresponding to rat cDNA sequence were used in a PGR 
reaction (PGR parameters: 94^=^0 for 30 sec, 55"C for 15 
sec and 72 for 30 sec for 35 cycles) to amplify a 155 
nt fragment from mouse genomic DNA which was homologous 
to rat persephin cDNA sequence. These primers were then 
30 used to obtain murine persephin genomic clones from a 
mouse 129 /Sv library in a PI bacteriophage vector 
(library screening service of Genome Systems, Inc., St. 
Louis , MO ) . 

Restriction fragments (3.4 kb Nco I and a 3.3 kb Bam 
35 HI) from this PI clone containing the persephin gene were 
identified by hybridization with a 210 nt fragment 
obtained by PGR using mouse genomic DNA with primers 
(forward, M2026; SEQ ID NO:96 and reverse, M3159; 5'- 
CCACCACAGCCACAAGCTGCGGSTGAGAGCTG, SEQ ID NO: 102) and PGR 
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parameters: 94**C for 30 sec, 55**C for 15 sec and 72°C for 
30 sec for 35 cycles. The Nco I and Bam HI fragments 
were sequenced and found to encode a stretch of amino 
acids corresponding to that present in the rat persephin 
5 RACE product, as well as being homologous to the mature 
regions of both neurturin and GDNF (Figure 11). 

When the amino acid sequences of murine GDNF, 
neurturin and persephin are aligned using the first 
cysteine as the starting point (which is done because 
10 alterations in the cleavage sites between family members 
creates variability in the segments upstream of the first 
cysteine), persephin (91 amino acids) is somewhat smaller 
than either neurturin (95 amino acids) or GDNF (94 amino 
acids). The overall identity within this region is about 
15 50% with neurturin and about 40% with GDNF (Figure 12). 
Further nucleotide sequencing of the murine 
persephin Nco I fragment revealed the nucleotide sequence 
of the entire murine persephin gene ( SEQ ID NO: 131; 
Figure 17A). An open reading frame extends from the 
20 sequence coding for an initiator methionine up to a stop 
codon at positions 244^246. However, somewhere in this 
sequence there is an apparent anomaly such that the 
sequence encoding the RXXR cleavage site (nucleotides at 
positions 257-268) and the sequence corresponding to the 
25 mature persephin protein (positions 269-556) are not co- 
linear with this open reading frame. Instead, a second 
reading frame encodes the cleavage site and the mature 
persephin • 

Additional sequencing of the rat persephin has also 
30 been performed. Rat genomic fragments were amplified by 
PGR using Klentaq and rat genomic DNA as a template. 
The forward primer #40266 ( 5 » -AATCCCCAGGACAGGCAGGGAAT; 
SEQ ID NO: 137) corresponding to a region upstream of the 
mouse persephin gene and a reverse primer M3156 (5*- 
35 CGGTACCCAGATCTTCAGCCACCACAGCCACAAGC, SEQ ID NO: 138) 

corresponding to a region within the mature rat persephin 
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sequence were used with the following parameters (95°C 
for 15 sec, 55**C for 15 sec, 68^C for 45 sec x 30 
cycles)- The amplified product was kinased with T4 
polynucleotide kinase, the ends were blunted with E. coli 
5 DNA polymerase I (Klenow fragment), and cloned into BSKS 
plasmid. 

Nucleotide sequencing was performed to establish the 
sequence of the entire rat persephin gene ( SEQ ID NO: 134; 
Figure 18A), An open reading frame was found to extend 

10 from the sequence coding for an initiator methionine up 
to a stop codon at positions 244-246 as was seen with 
murine persephin. As was also seen with murine 
persephin, an anomaly was found to occur between the 
sequence encoding the initiator methionine and that 

15 encoding the cleavage site for the mature rat persephin 
such that two cogent reading frames exist. Irrespective 
of this anomaly, mammalian cells were found to express 
persephin from either the murine or rat full length 
genomic sequence as illustrated below (see Example 14). 

20 To pursue the genesis of this anomaly, we prepared 

mammalian expression vectors for both murine and rat 
persephin. To construct the murine plasmid, a PI clone 
containing the murine persephin gene was used as a 
template in a PGR assay. Primers were designed such that 

25 the resulting fragment would contain the persephin gene 
extending from the initiator Methionine to the stop 
codon. The PGR reaction utilized a forward primer M3175 
[5' -TGGTGTGAGGATGGGTGGAGGAAGAGTTCGGA] and reverse primer 
M31 56 [ 5 * -CGGTACCCAGATCTTCAGCCACCACAGCCACAAGC ] . To 

30 construct the analogous rat plasmid, rat genomic DNA was 
used as a template in a PGR assay. The PGR reaction 
utilized a forward primer M3175 [5*- 

TGCTGTCACCATGGCTGCAGGAAGACTTCGGA] and reverse primer 
M3 1 5 6 [ 5 • - CGGT ACCC AGATCTTC AGCCACC AC AGCC ACAAGC ] . The 
35 amplified products were cloned into BSKS and sequenced to 
verify that the correct clone had been obtained. The rat 
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and murine persephin fragments were excised using Sma I 
and Hind III and cloned into a Asp718 (blunted) and Hind 
III sites of the mammalian expression vector pCB6 . 

COS monkey cells were transfected with either the 
5 rat or murine persephin expression vectors or the non- 
recombinant vector (pCB6) itself. Forty eight hr later 
the cells were lysed, the samples were loaded onto a 15% 
SDS-polyacrylamide gel, and the proteins were separated 
by electrophoresis. The proteins were then transferred 
10 to nitrocellulose by electroblotting . This 

nitrocellulose membrane was incubated with ant i -persephin 
antibodies (which we raised to mature persephin produced 
in bacteria from a pET plasmid) to detect the presence of 
persephin in the lysates. Lysates from cells transfected 
15 with either the rat or murine persephin expression 

vectors, but not the lysate from cells transfected with 
pCB6, contain high amounts of persephin. The size of the 
persephin detected was 10-15 kD, consistent with the size 
predicted for the processed (i.e. mature form of 
20 persephin). Conditioned media harvested from these cells 
also contained mature persephin. These results 
demonstrate that both the murine and rat persephin genes 
are capable of directing the synthesis of a properly 
processed persephin molecule. 
25 To pursue the mechanism by which this occurred, we 

Isolated RNA from cells transfected with either rat or 
murine persephin expression vector. RT/PCR analysis was 
performed using primers corresponding to the initiator 
Met and the stop codon. We detected two fragments: one 
30 corresponding to the predicted size of the persephin gene 
and the other somewhat smaller, suggesting that RNA 
splicing had occurred. We confirmed this with a number 
of other primer pairs. Both the large and small 
persephin fragments were cloned and sequenced. As 
35 expected, the larger fragment corresponded to the 

persephin gene. The small fragment corresponded to a 
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spliced version of persephin. A small 88 nt intron 
within -the pro-domain (situated 154 nt downstream of the 
start codon) had been spliced out. After this splicing 
event, the "frameshift" was no longer present (i.e. the 
5 initiator Met and the mature region are in- frame) in 

either rat or mouse persephin (see Figures 17B and 18B). 

Example 13 

This example illustrates the preparation of a 

10 bacterial expression vector for murine persephin and its 

introduction into an E. Coll for expression of 

recombinant mature persephin. 

The persephin polynucleotide encoding the mature 

murine persephin protein which begins 5 amino acids 

15 upstream of the first framework Cys residue ( SEQ ID 

NO: 80) was cloned into the pET expression vector pET-30a 

at the Nde I and Bgl II sites. This persephin 

polynucleotide was generated by PGR using the murine 

persephin PI genomic clone as a template. A forward 

20 primer M3157 (5'- 

GGACTATCATATGGCCCACCACCACCACCACCACCACCACGACGACGACGACAAGGC 

CTTGGCTGGTTCATGCCGA, SEQ ID NO: 139) encoding an Nde I 
site, 8 histidine residues, and an enterokinase site, and 
a reverse primer M3156 (5'- 
25 CGGTACCCAGATCTTCAGCCACCACAGCCACAAGC, SEQ ID NO: 138), 
which corresponds to the sequence encoding the last 6 
amino acid residues of the mature persephin sequence, the 
stop codon and a Bgl II site, were used. The PGR 
reaction conditions were 95°G for 15 sec, 55°C for 15 
30 sec, 68*^0 for 60 sec x 25 cycles. This PGR product was 
subcloned into the EcoRV site of BSKS plasmid and 
sequenced to verify that it contained no mutations. The 
persephin sequence was then excised from this vector 
using Nde I and Bgl II and cloned into the Nde I (5' ) and 
35 Bgl II (3' ) sites of the bacterial expression vector 
pET30a (Novagen, Madison, WI ) . This expression vector 



BNSDOCID: <WO ^9914235A1J_> 



wo 99/14235 PCT/US98/19163 



106 

would, -therefore, produce the mature form of the 
persephln protein possessing an amino terminal tag 
consisting of 8 histidine residues followed directly by 
an enterokinase site. 
5 The plasmid was introduced into E.coli strain BL21 

(DE3). To produce persephin, bacteria harboring this 
plasmid were grown for 16 hr, harvested, and lysed using 
6M guanidine-HCl, 0.1 M NaHjPO^, 0.01 M Tris at pH 8.0, 
and recombinant persephin protein was purified from these 
10 lysates via chromatography over a Ni-NTA resin (Qiagen). 
The protein was eluted using 3 column volumes of Buffer E 
containing 8 M urea, 0.1 M NaHjPO^, 0.01 M Tris, at pH 
4.5. The persephin was then renatured by dialysis in 
renaturation buffer consisting of 0.1 M NaH2P04, 0.01 M 
15 Tris at pH 8.3, 0.15 M NaCl, 3 mM cysteine, 0.02% Tween- 
20, 10% glycerol and containing decreasing concentrations 
of urea beginning with 4 M for 16 hr, followed by 2 M for 
16 hr, IM for 72 hr, and 0.5 M for 16 hr. The persephin 
concentration was then determined using a Dot Metric 
20 assay (Geno Technology, St. Louis, MO) and stored at 4^C. 

This bacterially produced recombinant persephin was 
used as an immunogen in rabbits to produce antibodies to 
mature persephin. All of the immunogen injections and 
blood drawing were performed at Cal Tag Inc. (Healdsburg, 
25 CA). The ant i -persephin antiserum was demonstrated to 
specifically recognize persephin, but not neurturin or 
GDNF, using protein blot analysis. This persephin- 
specific antiserum was then used to detect persephin in 
lysates prepared from transfected COS cells. 

30 

Example 14 

This example illustrates the preparation of 
mammalian expression vectors containing the murine or rat 
persephin genes and their incorporation into mammalian 
35 cell lines for the production of mature persephin. To 
construct the murine plasmid, a PI clone containing the 
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murine persephin gene was used as a template in a PGR 
assay- Primers were designed such that the resulting 
polynucleotide would contain the persephin gene extending 
from the initiator Methionine codon to the stop codon 3 ' 
5 to the mature persephin coding sequence ( SEQ ID NO: 131). 
The PGR reaction utilized a forward primer M3175 (5'- 
TGGTGTGAGGATGGGTGGAGGAAGAGTTGGGA, SEQ ID NO: 140) and 
x^everse primer M3156 (5'- 

CGGTACCCAGATCTTCAGCCACCACAGCCACAAGC, SEQ ID NO: 138). To 
10 construct the analogous rat plasmid, rat genomic DNA was 
used as a template in a PGR assay. The PGR reaction 
utilized a forward primer M3175 (5'- 

TGCTGTCACCATGGCTGCAGGAAGACTTCGGA, SEQ ID NO: 140) and 
reverse primer M3156 (5'- 

15 CGGTACCCAGATCTTCAGCCACCACAGCCACAAGC, SEQ ID NO: 138). 

Both PGR reactions were carried out using Klentaq and the 
following parameters: 95 for 15 sec, 55**C for 15 sec, 
68**C for 45 sec x 25 cycles. The amplified products were 
kinased with T4 polynucleotide kinase, the ends were 

20 blunted with E. coli DNA polymerase I (Klenow fragment), 
and cloned into BSKS plasmid. Nucleotide sequencing was 
performed to verify that the correct clone was obtained- 
The rat and murine persephin polynucleotides were excised 
using Sma I and Hind III and each cloned into a Asp718 

25 (blunted) and Hind III sites of the mammalian expression 
vector pCB6 . 

COS monkey cells were transfected with either the 
rat or murine persephin expression vectors (16 yxg per 5 x 
10^ cells) or the non- recombinant vector ( pCB6 ) itself 

30 using the calcium phosphate precipitation method (Chen 
and Okayama, Mol Cell Biol 7:2745-2752, 1987 which is 
incorporated by reference ) . Forty eight hr later the 
cells were lysed in IP buffer containing 50 mM Tris at pH 
7.5, 300 mM NaCl, 1% Triton X-100, 1% deoxycholate , 10 mM 

35 EDTA, 0.1% SDS, 5 pg/ml leupeptin, 7 pg/ml pepstatin, and 
250 pM PMSF. The samples were loaded onto a 15% SDS- 
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polyacrylamide gel and the proteins were separated by 
electrophoresis. The proteins were then transferred to 
nitrocellulose by electroblotting • This nitrocellulose 
membrane was incubated with anti-persephin antibodies to 
5 detect the presence of persephin in the lysates . 

As is shown in Figure 19, lysates from cells 
transfected with either the rat or murine persephin 
expression vectors, but not the lysate from cells 
transfected with pCB6, contain high amounts of persephin. 

10 The size of the persephin detected was approximately 14 
kD which is consistent with the size predicted for the 
processed, i.e. mature form of persephin. This 
demonstrates that both the murine and rat persephin genes 
are capable of directing the synthesis of a properly 

15 processed persephin molecule. 

Example 15 

The following example illustrates the isolation and 
identification of human persephin. 

20 In order to identify the human homologue of 

persephin or additional members of the GDNF family, 
degenerate PCR primers were designed based on the human 
Neurturin and GDNF sequences and used to amplify human 
genomic DNA. The following primers were used (SEQ ID 

25 NOS : 225-228 ) : 



DhNeurturinl ( DNl ) GTSASYGASYTGGGYCTGGGCTAY REF : B-46Z 

DhNeurturin2 ( DN2 ) TTYMGSTACTGCRSMGGCKCYTGC REF : B-46X 

DhNeurturin3r (DN3) RWAGGCSRTSGGKCKGCARCAKGS REF:B-46V 

30 DhNeurturin4r (DN4) MKCRTCYARRAASGACASSTC REF:B-46W 



Human genomic DNA (Clontech 6550-1 O.lpg/pl) was 
amplified with all 4 possible primer combinations (DNl- 
DN3r, DNl-DN4r, DN2-DN3r, DN2-DN4r). Reaction mixtures 
35 contained 5 vil of lOx Klentaq buffer, 0.5 vil dNTP (20mM); 
1 pi human genomic DNA, 0.5 \il Klentaq (Clontech) and 1.5 
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Vil of each primers (0.1 OD/pl ) in a total volume of 50 
pi. The DNA was amplified by touchdown PGR on Parkin 
Elmer Gene AMP 9600 in the following conditions: initial 
denaturation at 98"C for 2', then 5 cycles [98°C 30", 
5 72"C 1.5'], 5 cycles [98°C 30", 70"C 1.5'] and 25 cycles 
[98°G 30', 68 ''C 1.5'] followed by a last extension step 
at 68**C for 5 ' . 

PGR products of approximately 130 to 200 bp were 
identified after electrophoresis on agarose gel, purified 
10 and cloned into pCR 2.1 vector using In Vitrogen TA 

cloning kit (Gat # K2000-01 ) . Glones containing a 130- 
200bp insert (after EcoRl digestion) were sequenced and 
one of them (clone A3) obtained with the primer pair DNl- 
DN3r had a sequence homologous to mouse persephin and 
15 corresponded to human persephin. 

The partial sequence of the human persephin genomic 
DNA in clone A3 is shown below ( SEQ ID NO: 229): 
GGGGTTGTGAGGGAGGTGGGGGTGGGGTAGGCCTCAGAGGAGAAGGTGATGTTGGGC 
TACTGCGCCGGCAGCTGCCCCCGTGGTGCCCGCACCCAGCATGGCCTGGCGCTGGCC 
20 CGGCTGCAGGGCCAGGGCCGAGCCCACGGCGGGCCCTGCTGCCGCCCCATGGCC 

In order to identify a source from which to isolate 
a full length cDNA clone of human persephin, cDNA 
libraries were screened by PCR using exact match primers 
designed based on the genomic DNA sequence described 
25 above. Two sets of human persephin specific primers were 
prepared (SEQ ID NOS: 230-233 ) , 

hPSP-5 ' • 1 GAGGAGAAGGTCATCTTCCG REF : B-95K 

hPSP-3 ' . 1 GCCGTGGGCTCGGCCCTGGC REF : B-95L 

and 

30 hPSP-5 ' . 3 AGAGGAGAAGGTCATCTTCCGCTA REF : C-62Y 

hPSP-3 ' . 4 CTCGGCCCTGGCCCTGCAGC REF: C-62X 

and used to amplify single stranded DNA from pRK5 cDNA 
libraries ( Ipl of 200 ng/pl ) or the Stratagene's 
35 Quickscreen panel ( 3yl of each library) using the same 
conditions as described above. PCR products of the 
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expected size (108 bp for hPSP-5'1 with hPSP-3'.l and 101 
bp for hPSP-5'.3 with hPSP-3'.4) were detected in fetal 
lung, fetal liver, fetal kidney, small intestine, retina, 
cerebellum and hT+13 Lymphoblast. 
5 To isolate cDNA clones encoding human persephin, 

pRK5 libraries from human tissues were enriched for 
persephin cDNA clones by extension of single stranded DNA 
from plasmid libraries grown in a dut'/ung'-host using 
either of the following primers ( SEQ ID NOS : 233-234 ) : 

10 

hPSP-3 ' . 2 TGCAGCCGGGCCAGCGCCAG REF: D-68T 

hPSP-3 • . 4 CTCGGCCCTGGCCCTGCAGC REF : C-62X 

in a reaction containing lOpl of lOx PGR Buffer { Perkin 
15 Elmer), 1^1 dNTP (20mM), library DNA (200 ng ) , 0.5yl 

primer, 86.5ial HjO and Ipl of Amplitaq (Perkin Elmer) 
added after a hot start. The reaction was denatured for 
1 min at 95**C, annealed for 1 min at 50, 60 or 68 ^'C then 
extended for 20 min at 72^0. DNA was extracted with 
20 phenol /chloroform, ethanol precipitated, then transformed 
by electroporation into DHIOB host bacteria. 

Approximately 40,000 colonies from each 
transformation were lifted on nylon membranes and 
screened with a DNA probe derived from the sequence of 
25 clone A3 (described above). The fragment was labeled by 
the random oligonucleotide method using [32P]-dCTP. 
Filters were hybridized overnight at 42'*C in 50% 
formamide, 5xSSC, lOxDenhardt ' s, 0.05M sodium phosphate 
(pH 6.5), 0-1% sodium pyrophosphate, 50 pg/ml of 
30 sonicated salmon sperm DNA. Filters were then rinsed in 
2xSSC and washed in 0-lxSSC, 0.1% SDS then exposed 
overnight to Kodak X Ray films. Pure positive clones 
were obtained after secondary screening and the isolated 
clones were then sequenced. Human persephin clones were 
35 isolated from fetal lung, fetal kidney and fetal liver 

libraries. All such persephin clones isolated belong to 
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two categories: unspliced (10 clones) or chimeric (6 
clones). Unspliced clones were -900bp long and contain a 
region encoding a fragment corresponding to human 
persephin. However there is no initiation methionine and 
5 signal peptide present in that reading frame (Frame +2), 
A potential upstream initiation codon (ATG) is present in 
another reading frame (+1) and is followed by a 
hydrophobic sequence corresponding to a potential signal 
peptide. This suggests that such cDNAs are incompletely 
10 spliced and that an intron remains between the exons 
encoding the signal peptide and the persephin protein. 
Consensus splice donor and acceptor sequences can 
actually be identified at position 340 and 425, 
respectively. Splicing of an intron located between 
15 these positions would lead to a cDNA where the persephin 
coding sequence is "in frame" with the initiation 
methionine. Interestingly, aberrant chimeric clones were 
identified resulting from the joining of a cDNA encoding 
the predicted exon 2 of persephin exactly at the splice 
20 acceptor site present at position 425. The corresponding 
transcript was probably generated by aberrant splicing 
but confirms the presence of a splice acceptor site at 
position 425. 

As an alternative approach to isolate a human 
25 persephin cDNA clone, 3 million clones of a human 

cerebellum cDNA library in lambda ZAP ( Stratagene cat 
#935201 ) were screened with a DNA probe corresponding to 
clone A3 labeled by the random oligonucleotide method 
using [32P]-dCTP. The library was screened under high 
30 stringency hybridization conditions. The filters were 

prehybridized for 2h then hybridized overnight at 42^*0 in 
50% formamide, 5xSSC, lOxDenhardt ' s, 0.0 5M sodium 
phosphate (pH 6.5), 0.1% sodium pyrophosphate, 50 pg/ml 
of sonicated salmon sperm DNA. Filters were then rinsed 
35 in 2xSSC and washed once in O.lxSSC, 0.1% SDS at eO'^C. 
Filters were exposed overnight to Kodak X Ray films. 
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Four posit:ive clones ( Cere 1.1, 1.2, 6.1, 6.2) were 
picked and plaque purified. The plasmid contained within 
the lambda ZAP phage arms was rescued as described per 
manufacturer's instructions using Ex Assist helper phage. 
5 Sequencing of the four clones indicated that these clones 
were siblings and contained 2 silent mutations when 
compared to the human persephin clones isolated from the 
pRK5 library described above. These silent mutations 
occur at positions 30 (T->C) and 360 (T->C) of the sequence 
10 shown in Figure 24. Direct sequencing of the human 

persephin gene revealed that these silent mutations are 
actual allelic variations in the gene. 

In order to determine if the correct protein could 
be expressed from the unspliced cDNA identified above, 
15 constructs were generated where the sequence encoding a 
Flag tag was inserted just before the stop codon present 
at position 685 (frame + 1) or at position 746 (frame +2) 
starting from an ATG codon present at position 46 or 193. 
All four possible constructs were generated by PGR using 
20 the following primers ( SEQ ID NOS : 235-238 ) : 

hPSPlstMet.F 5'CGC GGA TCC ATG CCT GGA TTC 

GAG GGT GCA G 3' REF:B-127R 
hPSP2ndMet.F 5' CGC GGA TCC ATG GCC GTA GGG 

25 AAG TTC CTG C 3' REF:B-127S 

hPSP.FLAG.R 5' CTC CCA AGC TTT TAC TTG TCA 

TCG TCG TCC TTG TAG TCG CCA CCA 
CAG CCG CAG GCA GCC 3' REF:A-120C 
hPSP.sig.FLAG.R 5' CTC CCA AGC TTT TAC TTG TCA 

30 TCG TCG TCC TTG TAG TCT CGA GGA 

AGG CCA CGT CGG TG 3' REF:A-120B 

In all four PCR reactions, the Cere 1.2 clone was 
used as template and amplified with Pfu polymerase on a 
35 Stratagene Robocyler gradient cycler 96. PCR conditions 
were 95"C for 2', 30 cycles of [95"C 30", 1' at 52, 56, 
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60 or 63 ^'C, 72 °C for 2 min] followed by a last extension 
of 5 min at 72 ^C, 

Forward primers have a Bam HI site and the reverse 
primers have Hind III restriction site. The PGR products 
5 digested with Bam HI and Hind III and were subcloned into 
these sites in pRK5. DNA from each of the construct was 
transfected overnight into 293 cells using CaP04 method. 
Serum containing media was conditioned for 24h then 
harvested. Cells were also harvested and divided in two; 
10 1/4 of each plate for RT-PCR and the remaining 3/4 of 
each plate for immunoprecipitation. 

Analysis of expressed proteins was performed by 
immunoprecipitation. The cell pellet was lysed in 1 ml 
of lysis buffer (50 mM Tris pH 8.0, 150 mM NaCl, 1 mM 
15 EDTA, 1% NP40, Aprotinin, Leupeptin, PMSF, 1 mM NaF and 1 
mM Sodium Vanadate) for 20 min at 4**C. The extract was 
spun for 10 min at lOK rpm and then the supernatant was 
transferred to a new tube and precleared with 20 \xl 
Protein A Sepharose for Ih. From here, 1 ml of the 
20 conditioned media was processed in parallel. The protein 
A sepharose was spun down and 1 \il of anti-Flag antibody 
(3.6 pg) was added to each tube. After overnight 
incubation at 4*C, 30 pi of Protein G sepharose were 
added and the tubes incubated at 4"C for 1 hour. The 
25 protein G beads were then spun down for 1 min, washed 3 
times with lysis buffer, resuspended in 20 pi of Laemli 
buffer in the presence of p-mercapto-ethanol . Samples 
were denatured for 5 min at 100 ""C then loaded on a 16% 
polyacrylamide gel. Proteins were then transferred to 
30 nitrocellulose and analyzed by Western blot using the 

same anti-Flag antibody overnight at Ipg/ml in blocking 
buffer (PBS + 0.5% tween + 5% nonfat dry milk +3% Goat 
serum). Following this an anti-mouse HRP.ECL was used 
for the detection and the membrane was exposed for 90 sec 
35 to X-Ray film. 
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A specific band of 16 kDa was detected in the cell 
pellet of cells transfected with the construct starting 
at ATG 193 of frame +1 and with the flag inserted in 
frame +2 and a specific band of approximately 10 kDa 
5 could be detected in the corresponding supernatant. No 
Flag tagged protein could be detected in any other 
transfection or in mock transfected cells* 

The correctly spliced mRNA was identified by RT-PCR 
as follows. Total RNA was extracted from the transfected 
10 cells (1/4 of each pellet) using RNAzol B (Tel-Test Inc.) 
and treated for 40 min at 37 °C with DNase. RNA was then 
purified on an RNAasy column ( Pr omega ) and collected in a 
final volume of 50vil. First strand cDNA was synthesized 
on 4 pi RNA using superscript RT (GIBCO-BRL) for Ih at 
15 37 then 5' at 95** to inactivate. 

Two \xl of each RT reaction were then used as 
template for amplification by PGR in the presence of the 
following 2 primers ( SEQ ID NOS:236 and 239): 

20 

hPSP2ndMet . F 5 ' CGCGGATCCATGGCCGTAGGGAAGTTCCTGC 3 ' 

REF:B-127S 

hPSP . stop . R TCAGCCACCACAGCCGCAGGCAGCC REF : D- 103N 

25 on a Stratagene Robocyler gradient cycler 96. PGR 

conditions were 98"G for 1.5\ 28 cycles of [98"G for 
30", anneal 1' between 60^0 and 76**G, 72**C for 1.5'] 
followed by a last extension of 5 min at 72**C. 

Analysis of the PGR product on agarose gel indicates 

30 that PGR using the pRK5 .hPSP-FLAG. 2 plasmid as template 
gave the expected product of about 570 bp while the RT 
PGR product, using RNA from cells transfected with this 
construct as template, was smaller than 500 bp. PGR 
product from the latter reaction was subcloned into the 

35 pGR 2.1 vector using In Vitrogen TA cloning kit (cat 
#K2000-01 ) and sequenced. Sequence analysis revealed 



BNSDOCID: <WO_991423SA1J_> 



wo 99/14235 



PCT/US98/19163 



115 

that: the predicted 84 bp intron has been spliced out of 
the transcript . 

In summary, the human persephin cDNA has a 471 bp 
open reading frame encoding a 156 amino acid long protein 
5 (predicted Mr 16.6kDA), Cleavage of the 23 amino acid 
long predicted signal peptide will lead to a 133 amino 
acid pro-persephin molecule (Mr 14.2 kDa ) ; proteolytic 
cleavage of the pro-persephin at a RXXR consensus 
sequence should yield a 96 amino acid mature protein with 

10 a molecular weight of 10.3 kDa. This predicted size 
corresponds to the size of the Flag- tagged protein 
immunoprecipitated from the conditioned media of 
transfected 293 cells. Furthermore, amino terminal 
sequencing of Flag- tagged persephin purified from 

15 conditioned media of 293 transfected cells confirmed that 
the first residue of the mature form is Ala 61. 
Alignment between human persephin and human Neurturin 
indicates 38% similarity between the two molecules ( 50% 
for the mature region) and that human persephin is 30% 

20 similar to human GDNF (40% in the mature region). 

Example 16 

This example illustrates the preparation of chimeric 
or hybrid polypeptide molecules that contain portions 

25 derived from persephin (PSP) and portions derived from 
neurturin ( NTN ) . 

As closely related members of the TGFp family, each 
of persephin and neurturin is predicted to have a very 
similar overall structure, yet while neurturin promotes 

30 the survival of sympathetic neurons, the closely related 
persephin does not. Two chimeras were produced by 
essentially replacing portions of persephin with 
neurturin, with the crossover point located between the 
two adjacent, highly conserved third and fourth cysteine 

35 residues. The first chimera, named PSP/NTN (SEQ ID 
NO: 141, Figure 20), contains the first 63 residues of 
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mature murine persephin combined with residues 68 -through 
100 of mature murine neurturin (using E. coli preferred 
codons). To construct: this molecule, two PGR reactions 
were performed: 1) using the forward primer M2012 (5'- 
5 TAATACGACTCACTATAGGGGAA, SEQ ID NO: 142) and reverse 
primer M2188 ( 5 ' - 

TCGTCTTCGTAAGCAGTCGGACGGCAGCAGGGTCGGCCATGGGCTCGAC , SEQ ID 
NO: 143) and the pET30a-murine persephin plasmid as 
template (see Example 13); and 2) using the forward 
10 primer M2190 ( 5 ' -TGCTGCCGTCCGACTGCTTACGAAGACGA, SEQ ID 
NO: 144) and reverse primer M2186 (5'- 

GTTATGCTAGTTATTGCTCAGCGGT , SEQ ID NO: 145) and the pET30a- 
murine (E.coli preferred codons) neurturin plasmid as 
template ( see Example 6 ) . Both PGR reactions were 
15 carried out using the following parameters: 94-G for 30 
sec, 55 "^G for 30 sec, 72 for 30 sec x 25 cycles. The 
products of these two PGR reactions were gel purified, 
mixed together, and a PGR reaction was performed under 
the following conditions: 94 for 30 sec, 60 "G for 20 
20 min, 68''G for 5 min. After 8 cycles, an aliquot of this 
reaction was used as template in a third PGR reaction 
using the forward primer M2012 and reverse primer M2186 
under the following conditions: 94 'G for 30 sec, 55**C for 
30 sec, 72 '•G for 30 sec x 25 cycles. The resulting 
25 product was kinased with T4 polynucleotide kinase, the 
ends were blunted with E. coli DNA polymerase I (Klenow 
fragment), and cloned into BSKS plasmid. Nucleotide 
sequencing was performed to verify that the correct clone 
was obtained. The PSP/NTN fragment was excised using Nde 
30 I and Bam HI and cloned into the corresponding sites of 
the bacterial expression vector pET30a. 

The second chimera, named NTN/PSP (SEQ ID NO: 146, 
Figure 20), encodes the converse molecule. It contains 
the first 67 residues of mature murine neurturin (using 
35 E.coli preferred codons) combined with residues 64 to 96 
of mature murine persephin • To construct this molecule. 
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we performed two PGR reactions: 1) using the forward 
primer M2012 and reverse primer M2183 (5'- 

CACATCAGCATAGCTGGTGGGCTGGCAGCACGGGTGAGCACGAGCACGTT, SEQ 
ID NO: 147) and the pET30a-murine (E.coli preferred 
5 codons) neurturin plasmid as template; and 2) using the 
forward primer M2187 ( 5 ' -TGCTGCCAGCCCACCAGCTATGCTG, SEQ 
ID NO: 148) and reverse primer M2186 (5'- 

GTTATGCTAGTTATTGCTCAGCGGT , SEQ ID NO: 145) and the pETSOa- 
murine persephin plasmid as template. Both PGR reactions 

10 were carried out using the following parameters: 94*'C for 
30 sec, 55^G for 30 sec, 72"G for 30 sec x 25 cycles. 
The products of these two PGR reactions were used to 
construct the final NTN/PSP pET30a plasmid as detailed 
above for PSP/NTN except that Bgl II was used instead of 

15 Bam HI. These chimeric proteins were produced in E.coli 
and purified by Ni-NTA chromatography as described above 
( Example 13 ) . 

The purified proteins were assayed for their ability 
to promote survival in the SGG sympathetic neuron assay. 

20 The NTN/PSP protein did not promote survival, whereas the 
PSP/NTN protein promoted the survival of sympathetic 
neurons similar to that observed for neurturin itself. 
These results indicate that neurturin residues lying 
downstream of the 2 adjacent, highly conserved cysteine 

25 residues are critical for activity in promoting survival 
in SGG sympathetic neurons. In contrast, the 
corresponding residues of persephin are not sufficient 
for promoting survival in sympathetic neurons. 

30 Example 17 

This example illustrates the neuronal survival 
promoting activity of persephin in mesencephalic cells. 

The profile of survival promoting activity of 
persephin is different from that of neurturin and GDNF. 
35 In contrast to the survival promoting activity produced 
by neurturin and GDNF in sympathetic and sensory neurons. 
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persephin showed no survival promoting act:ivit:y in these 
tissues. We further evaluated the neuronal survival 
promoting activity of persephin in mesencephalic cells. 
Timed-pregnant Sprague-Dawley rats were purchased 
5 from Harlan Sprague-Dawley . The mesencephalon was taken 
from rats measuring 1.2 to 1.4 cm in length and time 
dated to be embryonic day 14. The cranium was removed 
and the entire mesencephalon was placed in cold LI 5. The 
pooled mesencephalic tissue was resuspended in a serum- 

10 free medium consisting of DME/Hams F12 (#11330-032, Life 
Technologies) 1 mg/ml BSA, Fraction V (A-6793, Sigma 
Chemical Co.,), 5 yM Insulin (1-5500, Sigma), 10 nM 
progesterone {P0130, Sigma), 100 \iM putrescine, (p7505, 
Sigma), 30 nM Selenium (S07150, Pflatz & Bauer), 10 ng/ml 

15 rat transferrin (012-000-050, Jackson Chrompure), 100 
U/ml penicillin, and 100 U/ml of streptomycin. The 
pooled mesencephalic tissues were triturated 
approximately 80 times using a bent-tip pipette and the 
cells were plated in a 24-well dish ( CoStar ) at a density 

20 of 15,000 cells in a 100-pl drop. The dishes were coated 
with 125 ng/ml poly-d-lysine (p-7280, Sigma) and 25 ng/ml 
laminin (#40232, Collaborative Biomedical Products). 
These dissociated cells were allowed to attach for 2 
hours at 37**C in 5% COj and then fed with another 500 pi 

25 of the above serum-free medium with or without 

approximately 100 ng/ml of recombinant Persephin. These 
cells were photographed after 3 days of culture. 

Inspection of the cells over the course of 3 days in 
culture, showed a gradual decrease in cell number. In 

30 the absence of any growth factor, almost all of the cells 
were dead (Figure 21A), In the presence of persephin, a 
large increase in mesencephalic neuronal cell survival 
was evident ( Figure 218 ) . 

This study was repeated to obtain comparative 
35 effects on mesencephalic cell for persephin and the 
related growth factors, neurturin and GDNF. 
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Mesencephalic tissue was removed from E14 par pps, pooled 
and dissocia-ted in dispase for 30 min. Cells were then 
triturated and plated in supplemented N2 media and plated 
at a density of 20,000 cells per well in an 8-well 
5 chamber slike. The cells in a given well were either 

untreated or treated with a growth factor at 50 ng/ml for 
four days. Cells were washed once with PBS, fixed with 
4% paraformaldehyde for 30 minutes and stained with 
tyrosine hydroxylase (TOH) antibody (Chemicon, ABC- 

10 Vectastain kit) and counted. TOH staining served as a 
marker for dopaminergice cells inasmuch as TOH is a 
synthetic enzyme for dopamine. Figure 22 shows the mean 
cell counts for untreated and treated cells. Persephin 
(PSP), neurturin ( NTN ) and GDNF promoted survival of 

15 mesencephalic neuronal cells to a comparable extent. 

Example 18 

This example illustrates the expression of persephin 
in various tissues. 

20 A survey of persephin expression was performed in 

adult mouse tissues using semi-quantitative RT/PCR (see 
Example 9). Poly A RNA was isolated from brain, 
cerebellum, kidney, lung, heart, ovary, sciatic nerve, 
dorsal root ganglia, blood and spleen. This was then 

25 reverse transcribed to produce cDNA (see Kotzbauer et al. 
Nature 384:467-470, 1996 which is incorporated by 
reference). The PCR primers used were as follows: 
forward primer: 5 ' -CCTCGGAGGAGAAGGTCATCTTC ( SEQ ID 
NO: 149) and reverse primer: 5 ' TCATCAAGGAAGGTCACATCAGCATA 

30 (SEQ ID NO: 101). PCR was done for 26 cycles with an 
annealing temperature of 60*'C. To control for the 
presence of genomic DNA, RNA samples which were not 
reverse transcribed were used for PCR (for example, the 
tissue control shown in figure 22 is labeled "Kidney no 

35 RT"). All the samples were found to be without genomic 
DNA contamination. 



BNSDOCID: <WO ^9914235A1J_> 



wo 99/14235 



PCT/US98/19163 



120 

As Shown in Figure 23, a band of -the correcl: size 
(160 bp) was seen in the kidney sample. At higher cycle 
numbers a persephin band was also seen in brain. Thus, 
the distribution of expression of persephin in various 
5 mouse tissues differs from that of neurturin in rat 
( Example 8 ) . 

Deposit of Strain , The following strain is on deposit 
under the terms of the Budapest Treaty, with the American 
Type Culture Collection, 12301 Parklawn Drive, Rockville, 

10 MD* The accession number indicated was assigned after 

successful viability testing, and the requisite fees were 
paid. Access to said cultures will be available during 
pendency of the patent application to one determined by 
the Commissioner to be entitled thereto under 37 CFR 1.14 

15 and 35 USC 122. All restriction on availability of said 
cultures to the public will be irrevocably removed upon 
the granting of a patent based upon the application. 
Moreover, the designated deposits will be maintained for 
a period of thirty (30) years from the date of deposit, 

20 or for five ( 5 ) years after the last request for the 

deposit, or for the enforceable life of the U.S. patent, 
whichever is longer. Should a culture become nonviable 
or be inadvertently destroyed, or, in the case of 
plasmid-containing strains, lose its plasmid, it will be 

25 replaced with a viable culture. The deposited materials 
mentioned herein are intended for convenience only, and 
are not required to practice the present invention in 
view of the description herein, and in addition, these 
materials are incorporated herein by reference . 



strain 


Deposit Date 


ATCC No. 


DG44CHO-PHSP-NGFI-B 


August 25, 1995 


CRL 11977 
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In view of t:he above, it will be seen that the 
several advantages of the invention are achieved and 
other advantageous results attained. 

As various changes could be made in the above 
5 methods and compositions without departing from the scope 
of the invention, it is intended that all matter 
contained in the above description and shown in the 
accompanying drawings shall be interpreted as 
illustrative and not in a limiting sense. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: JOHNSON, EUGENE M 

MILBRANDT, JEFFREY D 
KOTZBAUER, PAUL T 
LAMPE, PATRICIA A 
KLEIN, ROBERT 
DESAUVAGE , FRED 

(ii) TITLE OF INVENTION: PERSEPHIN AND RELATED GRO\mi FACTOR 
(iii) NUMBER OF SEQUENCES: 242 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HOWELL & HAFERKAMP, L.C. 

(B) STREET: 7733 FORSYTH BOULEVARD, SUITE 1400 

(C) CITY: ST. LOUIS 

(D) STATE: MO 

(E) COUNTRY: USA 

(F) ZIP: 63105 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: HOLLAND, DONALD R. 

(B) REGISTRATION NUMBER: 35,197 

<C) REFERENCE/DOCKET NUMBER: 971486 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 314-727-5188 

(B) TELEFAX: 314-727-6092 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ala Arg Leu Gly Ala Arg Pro Cys Gly Leu Arg Glu Leu Glu Val Arg 
15 10 15 

Val Ser Glu Leu Gly Leu Gly Tyr Ala Ser Asp Glu Thr Val Leu Phe 
20 25 30 

Arg Tyr Cys Ala Gly Ala Cys Glu Ala Ala Ala Arg Val Tyr Asp Leu 
35 40 45 

Gly Leu Arg Arg Leu Arg Gin Arg Arg Arg Leu Arg Arg Glu Arg Val 
50 55 60 



BNSDOCrD: <WO_9914235A1J_> 



wo 99/14235 



PCT/US98/19163 



123 

Arg Ala Gin Pro Cys Cys Arg Pro Thr Ala Tyr Glu Asp Glu Val Ser 
65 70 75 80 

Phe Leu Asp Ala His Ser Arg Tyr His Thr Val His Glu Leu Ser Ala 
85 90 95 

Arg Glu Cys Ala Cys Val 
100 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Pro Gly Ala Arg Pro Cys Gly Leu Arg Glu Leu Glu Val Arg Val Ser 
15 10 15 

Glu Leu Gly Leu Gly Tyr Thr Ser Asp Glu Thr Val Leu Phe Arg Tyr 

20 25 30 

Cys Ala Gly Ala Cys Glu Ala Ala lie Arg lie Tyr Asp Leu Gly Leu 
35 40 45 

Arg Arg Leu Arg Gin Arg Arg Arg Val Arg Arg Glu Arg Ala Arg Ala 
50 55 60 

His Pro Cys Cys Arg Pro Thr Ala Tyr Glu Asp Glu Val Ser Phe Leu 
65 70 75 80 

Asp Val His Ser Arg Tyr His Thr Leu Gin Glu Leu Ser Ala Arg Glu 
85 90 95 

Cys Ala Cys Val 
100 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "ANY AMINO ACID" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ser Gly Ala Arg Pro Xaa Gly Leu Arg Glu Leu Glu Val Ser Val Ser 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
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(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "ANY AMINO ACID" 

( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "SERINE OR CYSTEINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Xaa Cys Ala Gly Ala Xaa Glu Ala Ala Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRi\NDEDNESS : 

( D ) TO POLOG Y : linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "ANY AMINO ACID" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "ANY AMINO ACID" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /note= "GLUTAMINE OR GLUTAMIC ACID" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Xaa Xaa Val Glu Ala Lys Pro Cys Cys Gly Pro Thr Ala Tyr Glu Asp 
15 10 15 

Xaa Val Ser Phe Leu Ser Val 

20 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Tyr His Thr Leu Gin Glu Leu Ser Ala Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Gin Arg Trp Lys Ala Ala Ala Leu Ala Ser Val Leu Cys Ser Ser 
15 10 15 

Val Leu Ser lie Trp Met Cys Arg Glu Gly Leu Leu Leu Ser His Arg 

20 25 30 

Leu Gly Pro Ala Leu Val Pro Leu His Arg Leu Pro Arg Thr Leu Asp 
35 40 45 

Ala Arg lie Ala Arg Leu Ala Gin Tyr Arg Ala Leu Leu Gin Gly Ala 

50 55 60 

Pro Asp Ala Met Glu Leu Arg Glu Leu Thr Pro Trp Ala Gly Arg Pro 
65 70 75 80 

Pro Gly Pro Arg Arg Arg Ala Gly Pro Arg Arg Arg Arg Ala Arg Ala 
85 90 95 

Arg Leu Gly Ala Arg Pro Cys Gly Leu Arg Glu Leu Glu Val Arg Val 
100 105 110 

Ser Glu Leu Gly Leu Gly Tyr Ala Ser Asp Glu Thr Val Leu Phe Arg 
115 120 125 

Tyr Cys Ala Gly Ala Cys Glu Ala Ala Ala Arg Val Tyr Asp Leu Gly 
130 135 140 

Leu Arg Arg Leu Arg Gin Arg Arg Arg Leu Arg Arg Glu Arg Val Arg 
145 150 155 160 

Ala Gin Pro Cys Cys Arg Pro Thr Ala Tyr Glu Asp Glu Val Ser Phe 
165 170 175 

Leu Asp Ala His Ser Arg Tyr His Thr Val His Glu Leu Ser Ala Arg 
180 185 190 

Glu Cys Ala Cys Val 

195 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Arg Arg Trp hys Ala Ala Ala Leu Val Ser Leu lie Cys Ser Ser 
15 10 15 

Leu Leu Ser Val Trp Met Cys Gin Glu Gly Leu Leu Leu Gly His Arg 
20 25 30 

Leu Gly Pro Ala Leu Ala Pro Leu Arg Arg Pro Pro Arg Thr Leu Asp 
35 40 45 

Ala Arg lie Ala Arg Leu Ala Gin Tyr Arg Ala Leu Leu Gin Gly Ala 

50 55 60 

Pro Asp Ala Val Glu Leu Arg Glu Leu Ser Pro Trp Ala Ala Arg lie 
65 70 75 80 

Pro Gly Pro Arg Arg Arg Ala Gly Pro Arg Arg Arg Arg Ala Arg Pro 
85 90 95 

Gly Ala Arg Pro Cys Gly Leu Arg Glu Leu Glu Val Arg Val Ser Glu 
100 105 110 

Leu Gly Leu Gly Tyr Thr Ser Asp Glu Thr Val Leu Phe Arg Tyr Cys 
115 120 125 

Ala Gly Ala Cys Glu Ala Ala lie Arg lie Tyr Asp Leu Gly Leu Arg 
130 135 140 

Arg Leu Arg Gin Arg Arg Arg Val Arg Arg Glu Arg Ala Arg Ala His 
145 150 155 160 

Pro Cys Cys Arg Pro Thr Ala Tyr Glu Asp Glu Val Ser Phe Leu Asp 
165 170 175 

Val His Ser Arg Tyr His Thr Leu Gin Glu Leu Ser Ala Arg Glu Cys 
180 185 190 

Ala Cys Val 
195 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 06 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii> MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCGCGGTTGG GGGCGCGGCC TTGCGGGCTG CGCGAGCTGG AGGTGCGCGT GAGCGAGCTG 60 

GGCCTGGGCT ACGCGTCCGA CGAGACGGTG CTGTTCCGCT ACTGCGCAGG CGCCTGCQAG 120 

GCTGCCGCGC GCGTCTACGA CCTCGGGCTG CGACGACTGC GCCAGCGGCG GCGCCTGCGG 180 

CGGGAQCGGG TGCGCGCGCA GCCCTGCTGC CQCCCGACGG CCTACGAGGA CGAGGTGTCC 240 

TTCCTGGACG CGCACAGCCG CTACCACACG GTGCACGAGC TGTCGGCGCG CGAQTGCGCC 300 

TGCGTG 306 

(2) INFORMATION FOR SEQ ID NO : 10 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCGGGGGCTC GGCCTTGTGG GCTGCGCGAG CTCGAGGTGC GCGTGAGCGA GCTGGGCCTG 60 

GGCTACACGT CGGATGAGAC CGTGCTGTTC CGCTACTGCG CAGGCGCGTG CGAGGCGGCC 120 

ATCCGCATCT ACGACCTGGG CCTTCGGCGC CTGCGCCAGC GGAGGCGCGT GCGCAGAGAG 180 

CGGGCGCGGG CGCACCCGTG TTGTCGCCCG ACGGCCTATG AGGACGAGGT GTCCTTCCTG 240 

GACGTGCACA GCCGCTACCA CACGCTGCAA GAGCTGTCGG CGCGGGAGTG CGCGTGCGTG 300 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 

ATGCAGCGCT GGAAGGCGGC GGCCTTGGCC TCAGTGCTCT GCAGCTCCGT GCTGTCCATC 60 

TGGATGTGTC GAGAGGGCCT GCTTCTCAGC CACCGCCTCG GACCTGCGCT GGTCCCCCTG 120 

CACCGCCTGC CTCGAACCCT GGACGCCCGG ATTGCCCGCC TGGCCCAGTA CCGTGCACTC 180 

CTGCAGGGGG CCCCGOATGC GATGGAGCTG CGCGAGCTGA CGCCCTGGGC TGGGCGGCCC 240 

CCAGGTCCGC GCCGTCGGQC GGGGCCCCGG CGGCGGCGCG CGCGTGCGCG GTTGGGGGCG 300 

CGGCCTTGCG GGCTGCGCGA GCTGGAGGTG CGCGTGAGCG AGCTGGGCCT GGGCTACGCG 3 60 

TCCGACGAGA CGGTGCTGTT CCGCTACTGC GCAGGCGCCT GCGAGGCTGC CGCGCGCGTC 420 

TACGACCTCG GGCTGCGACG ACTGCGCCAG CGGCGGCGCC TQCGGCGGGA GCGGGTGCGC 480 

GCGCAGCCCT GCTGCCGCCC GACGGCCTAC GAGGACGAGG TGTCCTTCCT GGACGCGCAC 540 

AGCCGCTACC ACACGGTGCA CQAGCTGTCG GCGCGCGAGT GCGCCTGCGT G 591 
(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ATGAGGCGCT GGAAGGCAGC GGCCCTGGTG TCGCTCATCT GCAGCTCCCT GCTATCTGTC 60 
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TGGATGTGCC AGGAGGGTCT GCTCTTGGGC CACCGCCTGG GACCCGCGCT TGCCCCGCTA 



120 



CGACGCCCTC CACGCACCCT GGACGCCCGC ATCGCCCGCC TGGCCCAGTA TCGCGCTCTG 



180 



cfcCAGGGCG CCCCCGACGC GGTGGAGCTT CGAGAACTTT CTCCCTGGGC TGCCCGCATC 



240 



CCGGGACCGC GCCGTCGAGC GGGTCCCCGG CGTCGGCGGG CGCGGCCGGG GGCTCGGCCT 



300 



TGTGGGCTGC GCGAGCTCGA GGTGCGCGTG AGCGAGCTGG GCCTGGGCTA CACGTCGGAT 



360 



GAGACCGTGC TGTTCCGCTA CTGCGCAGGC GCGTGCGAGG CGGCCATCCG CATCTACGAC 



420 



CTGGGCCTTC GGCGCCTGCG CCAGCGGAGG CGCGTGCGCA GAGAGCGGGC GCGGGCGCAC 



480 



CCGTGTTGTC GCCCGACGGC CTATGAGGAC GAGGTGTCCT TCCTGGACGT GCACAGCCGC 



540 



TACCACACGC TGCAAGAGCT GTCGGCGCGG GAGTGCGCGT GCGTG 



585 



(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGAGGGAGAG CGCGCGGTGG TTTCGTCCGT GTGCCCCGCG CCCGGCGCTC CTCGCGTGGC 60 

CCCGCGTCCT GAGCGCGCTC CAGCCTCCCA CGCGCGCCAC CCCGGGGTTC ACTGAGCCCG 120 

GCGAGCCCGG GGAAGACAGA GAAAGAGAGG CCAGGGGGGG AACCCCATGG CCCGGCCCGT 180 

GTCCCGCACC CTGTGCGGTG GCCTCCTCCG GCACGGGGTC CCCGGGTCGC CTCCGGTCCC 240 

CGCGATCCGG ATGGCGCACG CAGTGGCTGQ GGCCGGGCCG GGCTCGGGTG GTCGGAGGAG 300 

TCACCACTGA CCGGGTCATC TGGAGCCCGT GGCAGGCCGA GGCCCAGG 348 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
TGCTACCTCA CGCCCCCCGA CCTGCGAAAG GGCCCTCCCT GCCGACCCTC GCTGAGAACT 6 0 

GACTTCACAT AAAGTGTGGG AACTCCC 87 
(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Gin Arg Trp Lys Ala Ala Ala Leu Ala Ser Val Leu Cys Ser Ser 
1 5 10 15 

Val Leu Ser 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

( C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Arg Arg Trp Lys Ala Ala Ala Leu Val Ser Leu lie Cys Ser Ser 
15 10 15 

Leu Leu Ser 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 
ATGCAGCGCT GGAAGGCGGC GGCCTTGGCC TCAGTGCTCT GCAGCTCCGT GCTGTCC 57 
(2) INFORMATION FOR SEQ ID NO : 18 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATGAGGCGCT GGAAGGCAGC GGCCCTGGTG TCGCTCATCT GCAGCTCCCT QCTATCT 57 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

lie Trp Met Cys Arg Glu Gly Leu Leu Leu Ser His Arg Leu Gly Pro 
15 10 15 

Ala Leu Val Pro Leu His Arg Leu Pro Arg Thr Leu Asp Ala Arg lie 
20 25 30 

Ala Arg Leu Ala Gin Tyr Arg Ala Leu Leu Gin Gly Ala Pro Asp Ala 
35 40 45 

Met Glu Leu Arg Glu Leu Thr Pro Trp Ala Gly Arg Pro Pro Gly Pro 
50 55 60 

Arg Arg Arg Ala Gly Pro Arg Arg Arg Arg Ala Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATCTGGATGT GTCGAGAGGG CCTGCTTCTC AGCCACCGCC TCGGACCTGC GCTGGTCCCC 60 

CTGCACCGCC TGCCTCGAAC CCTGGACGCC CGGATTGCCC GCCTGGCCCA GTACCGTGCA 12 0 

CTCCTGCAGG GGGCCCCGGA TGCGATGGAG CTGCGCGAGC TGACGCCCTG GGCTGGGCGG 180 

CCCCCAGGTC CGCGCCGTCG GGCGGGGCCC CGGCGGCGGC GCGCGCGT 22 8 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GTCTGGATGT GCCAGGAGGG TCTGCTCTTG GGCCACCGCC TGGGACCCGC GCTTGCCCCG 60 

CTACGACGCC CTCCACGCAC CCTGGACGCC CGCATCGCCC GCCTGGCCCA GTATCGCGCT 120 

CTGCTCCAGG GCGCCCCCGA CGCGGTGGAG CTTCGAGAAC TTTCTCCCTG GGCTGCCCGC 180 

ATCCCGGGAC CGCGCCGTCG AGCGGGTCCC CGGCGTCGGC QQGCQCQQ 228 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Val Trp Met Cys Gin Glu Gly Leu Leu Leu Gly His Arg Leu Gly Pro 
15 10 15 

Ala Leu Ala Pro Leu Arg Arg Pro Pro Arg Thr Leu Asp Ala Arg He 

20 25 30 

Ala Arg Leu Ala Gin Tyr Arg Ala Leu Leu Gin Gly Ala Pro Asp Ala 
35 40 45 

Val Glu Leu Arg Glu Leu Ser Pro Trp Ala Ala Arg He Pro Gly Pro 
50 55 60 

Arg Arg Arg Ala Gly Pro Arg Arg Arg Arg Ala Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met Gin Arg Trp Lys Ala Ala Ala Leu Ala Ser Val Leu Cys Ser Ser 
15 10 15 

Val Leu Ser He Trp Met Cys Arg Glu Gly Leu Leu Leu Ser His Arg 
20 25 30 

Leu Gly Pro Ala Leu Val Pro Leu His Arg Leu Pro Arg Thr Leu Asp 
35 40 45 

Ala Arg He Ala Arg Leu Ala Gin Tyr Arg Ala Leu Leu Gin Gly Ala 
50 55 60 

Pro Asp Ala Met Glu Leu Arg Glu Leu Thr Pro Trp Ala Gly Arg Pro 
65 70 75 80 

Pro Gly Pro Arg Arg Arg Ala Gly Pro Arg Arg Arg Arg Ala Arg 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Arg Arg Trp Lys Ala Ala Ala Leu Val Ser Leu He Cys Ser Ser 
15 10 15 
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Leu Leu Ser Val Trp Met Cys Gin Glu Gly Leu Leu Leu Gly His Arg 
20 25 30 

Leu Gly Pro Ala Leu Ala Pro Leu Arg Arg Pro Pro Arg Thr Leu Asp 
35 40 45 

Ala Arg lie Ala Arg Leu Ala Gin Tyr Arg Ala Leu Leu Gin Gly Ala 
50 55 60 

Pro Asp Ala Val Glu Leu Arg Glu Leu Ser Pro Trp Ala Ala Arg lie 

65 70 75 80 

Pro Gly Pro Arg Arg Arg Ala Gly Pro Arg Arg Arg Arg Ala Arg 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATGCAGCGCT GGAAGGCGGC GGCCTTGGCC TCAGTGCTCT GCAGCTCCGT GCTGTCCATC 60 

TGGATGTGTC GAGAGGGCCT GCTTCTCAGC CACCGCCTCG GACCTGCGCT GGTCCCCCTG 12 0 

CACCGCCTGC CTCGAACCCT GGACGCCCGG ATTGCCCGCC TGGCCCAGTA CCGTGCACTC 180 

CTGCAGGGGG CCCCGGATGC GATGGAGCTG CGCGAGCTGA CGCCCTGGGC TGGGCGGCCC 24 0 

CCAGGTCCGC GCCGTCGGGC GGGGCCCCGG CGGCGGCGCG CGCGT 285 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ATGAGGCGCT GGAAGGCAGC GGCCCTGGTG TCGCTCATCT GCAGCTCCCT GCTATCTGTC 60 

TGGATGTGCC AGGAGGGTCT GCTCTTGGGC CACCGCCTGG GACCCGCGCT TGCCCC6CTA 120 

CGACGCCCTC CACGCACCCT GGACGCCCGG ATCGCCCGCC TGGCCCAGTA TCGCGCTCTG 180 

CTCCAGGGCG CCCCCGACGC QGTGGAGCTT CGAGAACTTT CTCCCTGGGC TGCCCGCATC 240 

CCGGGACCGC GCCGTCGAGC GGGTCCCCGG CGTCGGCGGG CGCGG 285 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQITENCE CHARACTERISTICS: 

(A) LENGTH: 169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ATGCAGCGCT GGAAGGCGGC GGCCTTGGCC TCAGTGCTCT GCAGCTCCGT GCTGTCCATC 
TGGATGTGTC GAGAGGGCCT GCTTCTCAGC CACCGCCTCG GACCTGCGCT GGTCCCCCTG 
CACCGCCTGC CTCGAACCCT GGACGCCCGG ATTGCCCGCC TGGCCCAGT 
(2) INFORjyiATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ACCGTGCACT CCTGCAGGGG GCCCCGGATG CGATGGAGCT GCGCGAGCTG ACGCCCTGGG 
CTGGGCGGCC CCCAGGTCCG CGCCGTCGGG CGGGGCCCCG GCGGCGGCGC GCGCGTGCGC 
GGTTGGGGGC GCGGCCTTGC GGGCTGCGCG AGCTGGAGGT GCGCGTGAGC GAGCTGGGCC 
TGGGCTACGC GTCCGACGAG ACGGTGCTGT TCCGCTACTG CGCAGGCGCC TGCOAGGCTG 
CCGCGCGCGT CTACGACCTC GGQCTGCGAC GACTGCGCCA GCGGCGGCGC CTGCGGCGGG 
AGCGGGTGCG CGCGCAGCCC TGCTGCCGCC CGACGGCCTA CGAGGACGAG GTGTCCTTCC 
TGGACGCGCA CAGCCGCTAC CACACGGTGC ACGAGCTGTC GGCGCGCGAG TGCGCCTGCG 
TGTGA 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ATGAGGCGCT GGAAGGCAGC GGCCCTGGTG TCGCTCATCT GCAGCTCCGT GCTATCTGTC 60 
TGGATGTGCC AGGAGGGTCT GCTCTTGGGC CACCGCCTGG GACCCGCGCT TGCCCCGCTA 120 
CGACGCCCTC CACGCACCCT GGACGCCCGC ATCGCCCGCC TGGCCCAGT 169 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

ATCGCGCTCT GCTCCAGGGC GCCCCCGACG CGGTGGAGCT TCGAGAACTT TCTCCCTGGG 60 

CTGCCCGCAT CCCGGGACCG CGCCGTCGAG CGGGTCCCCG GCGTCGGCGG GCGCGGCCGG 120 

GGGCTCGGCC TTGTGGGCTG CGCGAGCTCG AGGTGCGCGT GAGCGAGCTG GGCCTGGGCT 180 

ACACGTCGGA TGAGACCGTG CTGTTCCGCT ACTGCGCAGG CGCGTGCGAG GCGGCCATCC 24 0 

GCATCTACGA CCTGGGCCTT CGGCGCCTGC GCCAGCGGAG GCGCGTGCGC AGAGAGCGGG 3 00 

CGCGGGCGCA CCCGTGTTGT CGCCCGACGG CCTATGAGGA CGAGGTGTCC TTCCTGGACG 3 60 

TGCACAGCCG CTACCACACG CTGCAAGAGC TGTCGGCGCG GGAGTGCGCG TGCGTGTGA 419 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 94 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Cys Gly Leu Arg Glu Leu Glu Val Arg Val Ser Glu Leu Gly Leu Gly 
15 10 15 

Tyr Ala Ser Asp Glu Thr Val Leu Phe Arg Tyr Cys Ala Gly Ala Cys 
20 25 30 

Glu Ala Ala Ala Arg Val Tyr Asp Leu Gly Leu Arg Arg Leu Arg Gin 

35 40 45 

Arg Arg Arg Leu Arg Arg Glu Arg Val Arg Ala Gin Pro Cys Cys Arg 
50 55 60 

Pro Thr Ala Tyr Glu Asp Glu Val Ser Phe Leu Asp Ala His Ser Arg 
65 70 75 80 

Tyr His Thr Val His Glu Leu Ser Ala Arg Glu Cys Ala Cys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Cys Gly Leu Arg Glu Leu Glu Val Arg Val Ser Glu Leu Gly Leu Gly 
15 10 15 

Tyr Thr Ser Asp Glu Thr Val Leu Phe Arg Tyr Cys Ala Gly Ala Cys 
20 25 30 
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Glu Ala Ala lie Arg He Tyr Asp 
35 40 

Arg Arg Arg Val Arg Arg Glu Arg 
50 55 

Pro Thr Ala Tyr Glu Asp Glu Val 
65 70 

Tyr His Thr Leu Gin Glu Leu Ser 

85 



Leu Gly Leu Arg Arg Leu Arg Gin 
45 

Ala Arg Ala His Pro Cys Cys Arg 
60 

Ser Phe Leu Asp Val His Ser Arg 
75 80 

Ala Arg Glu Cys Ala Cys 
90 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "SERINE OR THREONINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Val Xaa Xaa Leu Gly Leu Gly Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "THREONINE OR GLUTAMIC ACID" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "VALINE OR LEUCINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "LEUCINE OR ISOLEUCINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 
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(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 

(ix) FEATURE: 

(A) NAME/KEY: Modif led- s it e 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Glu Xaa Xaa Xaa Phe Arg Tyr Cys Xaa Gly Xaa Cys Xaa Xaa Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "THREONINE OR VALINE OR 
ISOLEUCINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "TYROSINE OR PHENYLALANINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- sit e 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 

( ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 

( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /note= "VALINE OR LEUCINE" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Cys Cys Azrg Pro Xaa Ala Xaa Xaa Asp Xaa Xaa Ser Phe Leu Asp 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "GLtJTAMIC ACID OR ASPARTIC 

ACID" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /note= "SERINE OR ALANINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 

Phe Arg Tyr Cys Xaa Gly Xaa Cys Xaa Xaa Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NTUyiE/KEY: Modif ied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "GLtTTAMIC ACID OR ASPARTIC 

ACID" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /note= "SERINE OR ALANINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Phe Arg Tyr Cys Xaa Gly Xaa Cys Xaa Xaa Ala 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE : 

(A) NAME/KEY: Modi fied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "ISOLEUCINE OR THREONINE OR 

VALINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "TYROSINE OR PHENYLALANINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Cys Cys Arg Pro Xaa Ala Xaa Xaa Asp Xaa 
15 10 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "TYROSINE OR PHENYIALANINE" 

( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 

( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 6 
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(D) OTHER INFORMATION: /note= "VALINE OR LEUCINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Ala Xaa Xaa Asp Xaa Xaa Ser Phe Leu Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR THREONINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "LEUCINE OR VALINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "ISOLEUCINE OR LEUCINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Glu Xaa Xaa Xaa Phe Arg Tyr Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13 amino acids 

(B) TYPE : amino acid 

(C) STIU^NDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR THREONINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "LEUCINE OR VALINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "ISOLEUCINE OR LEUCINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "SERINE OR ALANINE" 
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(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /note= "SERINE OR AIiANINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASP7VRTIC 

ACID" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Glu Xaa Xaa Xaa Phe Arg Tyr Cys Xaa Gly Xaa Cys Xaa 
15 10 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
GTNWSNGANY TNGGNYTNGG NTA 23 
(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
TTYMGNTAYT GYDSNGGNDS NTGYGANKCN GC 32 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GCNGMNTCRC ANSHNCCNSH RCARTANCKR AA 32 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
TCRTCNTCRW ANGCNRYNGG NCKRCARCA 29 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TCNARPAANS WNAVNTCRTC NTCRWANGC 29 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GARRMNBTNH TNTTYMGNTA YTG 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GARRMNBTNH TNTTYMGNTA YTGYDSNGGN DSNTGHGA 3 8 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

Ser Gly Ala Arg Pro Xaa Gly Leu Arg Glu Leu Glu Val Ser Val Ser 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCNACNGCNT AYGARGA 17 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ala Arg Ala His Pro Cys Cys Arg Pro Thr Ala Tyr Glu Asp Glu Val 
15 10 15 

Ser Phe Leu Asp 
20 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ARYTCYTGNA RNGTRTGRTA 20 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GACGAGGTGT CCTTCCTGGA CGTACACA 
(2) INFORMATION FOR SEQ ID NO : 54 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 : 
TAGCGGCTGT GTACGTCCAG GAAGGACACC TCGT 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CAGCGACGAC GCGTGCGCAA AGAGCG 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TAYGARGACG AGGTGTCCTT CCTGOACGTA CACAGCCGCT AYCAYAC 47 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GCGGCCATCC GCATCTACGA CCTGGG 26 
(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
CRTAGGCCGT CGGGCGRCAR CACGGGT 27 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GCGCCGAAGG CCCAGGTCGT AGATGCG 27 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CGCTACTGCG CAGGCGCGTG CGARGCGGC 29 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CGCCGACAGC TCTTGCAGCG TRTGGTA 27 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GAGCTGGGCC TGGGCTACGC GTCCGACGAG 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARA.CTERI ST I CS : 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GCGACGCGTA CCATGAGGCG CTGG/^GGCA GCGGCCCTG 39 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 
GACGGATCCG CATCACACGC ACGCGCACTC 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GACCATATGC CGGGGGCTCG GCCTTGTGG 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GACGGATCCG CATCACACGC ACGCGCACTC 30 
(2) INFORMATION FOR SEQ ID NO: 67: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CAGCGACGAC GCGTGCGCAA AGAGCG 26 
(2) INFORMATION FOR SEQ ID NO : 68 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
TAGCGGCTGT GTACGTCCAG GAAGGACACC TCGT 34 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
AAAAATCGGG GGTGYGTCTT A 21 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CATGCCTGGC CTACYTTGTC A 21 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CTGGCGTCCC AMCAAGGGTC TTCG 24 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STFiANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GCCAGTGGTG CCGTCGAGGC GGG 23 
(2) INFORMATION FOR SEQ ID NO : 73 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GGCCCAGGAT GAGGCGCTGG AAGG 24 
(2) INFORMATION FOR SEQ ID NO : 74 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
CCACTCCACT GCCTGAWATT CWACCCC 27 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CCATGTGATT ATCGACCATT CGQC 24 
(2) INFORMATION FOR SEQ ID NO: 76: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNE S S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Ser Pro Asp Lys Gin Met Ala Val Leu Pro Arg Arg Glu Arg Asn Arg 
15 10 15 

Gin Ala Ala Ala Ala Asn Pro Glu Asn Ser Arg Gly Lys Gly Arg Arg 
20 25 30 

Gly Gin Arg Gly Lys Asn Arg Gly Cys Val Leu Thr Ala lie His Leu 
35 40 45 

Asn Val Thr Asp Leu Gly Leu Gly Tyr Glu Thr Lys Glu Glu Leu lie 
50 55 60 

Phe Arg Tyr Cys Ser Gly Ser Cys Asp Ala Ala Glu Thr Thr Tyr Asp 
65 70 75 80 

Lys lie Leu Lys Asn Leu Ser Arg Asn Arg Arg Leu Val Ser Asp Lys 
85 90 95 

Val Gly Gin Ala Cys Cys Arg Pro lie Ala Phe Asp Asp Asp Leu Ser 
100 105 110 

Phe Leu Asp Asp Asn Leu Val Tyr His lie Leu Arg Lys His Ser Ala 
115 120 125 

Lys Arg Cys Gly Cys lie 
130 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHTUyVCTERISTICS : 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE£ 

Ser Pro Asp Lys 
1 

Gin Ala Ala Ala 
20 

Gly Gin Arg Gly 
35 

Asn Val Thr Asp 
50 

Phe Arg Tyr Cys 
65 

Lys lie Leu Lys 



ICRIPTION: SEQ 11 

Gin Ala Ala Ala 

5 

Ala Ser Pro Glu 



Lys Asn Arg Gly 
40 

Leu Gly Leu Gly 
55 

Ser Gly Ser Cys 
70 

Asn Leu Ser Arg 
85 



► NO: 77: 

Leu Pro Arg Arg 
10 

Asn Ser Arg Gly 
25 

Cys Val Leu Thr 



Tyr Glu Thr Lys 
60 

Glu Ser Ala Glu 
75 



Ser Arg Arg Leu 
90 



Glu Arg Asn Arg 

15 

Lys Gly Arg Arg 
30 

Ala lie His Leu 
45 

Glu Glu Leu lie 



Thr Met Tyr Asp 
80 

Thr Ser Asp Lys 
95 
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Val Gly Gin Ala Cys Cys Arg Pro Val Ala Phe Asp Asp Asp Leu Ser 
100 105 110 

Phe Leu Asp Asp Asn Leu Val Tyr His lie Leu Arg Lys His Ser Ala 
115 120 125 

Lys Arg Cys Gly Cys lie 
130 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ser Pro Asp Lys Gin Ala Ala Ala Leu Pro Arg Arg Glu Arg Asn Arg 
15 10 15 

Gin Ala Ala Ala Ala Ser Pro Glu Asn Ser Arg Gly Lys Gly Arg Arg 

20 25 30 

Gly Gin Arg Gly Lys Asn Arg Gly Cys Val Leu Thr Ala lie His Leu 
35 40 45 

Asn Val Thr Asp Leu Gly Leu Gly Tyr Glu Thr Lys Glu Glu Leu lie 

50 55 60 

Phe Arg Tyr Cys Ser Gly Ser Cys Glu Ala Ala Glu Thr Met Tyr Asp 
65 " 70 75 80 

Lys lie Leu Lys Asn Leu Ser Arg Ser Arg Arg Leu Thr Ser Asp Lys 
85 90 95 

Val Gly Gin Ala Cys Cys Arg Pro Val Ala Phe Asp Asp Asp Leu Ser 
100 105 110 

Phe Leu Asp Asp Ser Leu Val Tyr His lie Leu Arg Lys His Ser Ala 
115 120 125 

Lys Arg Cys Gly Cys lie 
130 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala Glu Leu Gly Leu Gly 
15 10 15 

Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr Cys Ala Gly Ser Cys 
20 25 30 
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Pro Gin Glu Ala Arg Thr Gin His Ser Leu Val Leu Ala Arg Leu Arg 

35 40 45 

Gly Arg Gly Arg Ala His Gly Arg Pro Cys Cys Gin Pro Thr Ser Tyr 
50 55 60 

Ala Asp Val Thr Phe Leu Asp Asp Gin His His Trp Gin Gin Leu Pro 
65 70 75 80 

Gin Leu Ser Ala Ala Ala Cys Gly Cys 
85 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: BO: 

Ala Leu Ala Gly Ser Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala 
15 10 15 

Glu Leu Gly Leu Gly Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr 
20 25 30 

Cys Ala Gly Ser Cys Pro Gin Glu Ala Arg Thr Gin His Ser Leu Val 
35 40 45 

Leu Ala Arg Leu Arg Gly Arg Gly Arg Ala His Gly Arg Pro Cys Cys 

50 55 60 

Gin Pro Thr Ser Tyr Ala Asp Val Thr Phe Leu Asp Asp Gin His His 
65 70 75 80 

Trp Gin Gin Leu Pro Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Val Arg lie Pro Gly Gly Leu Pro Thr Pro Gin Phe Leu Leu Ser Lys 
15 10 15 

Pro Ser Leu Cys Leu Thr lie Leu Leu Tyr Leu Ala Leu Gly Asn Asn 
20 25 30 

His Val Arg Leu Pro Arg Ala Leu Ala Gly Ser Cys Arg Leu Trp Ser 
35 40 45 
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Leu Thr Leu Pro Val Ala Glu Leu Gly Leu Gly Tyr Ala Ser Glu Glu 
50 55 60 

Lys Val He Phe Arg Tyr Cys Ala Gly Ser Cys Pro Gin Glu Ala Arg 

65 70 75 80 

Thr Gin His Ser Leu Val Leu Ala Arg Leu Arg Gly Arg Gly Arg Ala 
85 90 95 

His Gly Arg Pro Cys Cys Gin Pro Thr Ser Tyr Ala Asp Val Thr Phe 
100 105 110 

Leu Asp Asp Gin His His Trp Gin Gin Leu Pro Gin Leu Ser Ala Ala 
115 120 125 

Ala Cys Gly Cys Gly Gly 
130 

(2) INFORMATION FOR SEQ ID NO : 82 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNE S S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala Glu Leu Gly Leu Gly 
15 10 15 

Tyr Ala Ser Glu Glu Lys He He Phe Arg Tyr Cys Ala Gly Ser Cys 
20 25 30 

Pro Gin Glu Val Arg Thr Gin His Ser Leu Val Leu Ala Arg Leu Arg 
35 40 45 

Gly Gin Gly Arg Ala His Gly Arg Pro Cys Cys Gin Pro Thr Ser Tyr 
50 55 60 

Ala Asp Val Thr Phe Leu Asp Asp His His His Trp Gin Gin Leu Pro 
65 70 75 80 

Gin Leu Ser Ala Ala Ala Cys Gly Cys 
85 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala Glu Leu Gly Leu Gly 
15 10 15 

Tyr Ala Ser Glu Glu Lys He He Phe Arg Tyr Cys Ala Gly Ser Cys 
20 25 30 
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Pro Gin Glu Val Arg Thr Gin His Ser Leu Val Leu Ala Arg Leu Arg 
35 40 45 

Gly Gin Gly Arg Ala His Gly Arg Pro Cys Cys Gin Pro Thr Ser Tyr 
50 55 60 

Ala Asp Val Thr Phe Leu Asp Asp His His His Trp Gin Gin Leu Pro 
65 70 75 80 

Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
85 90 

(2) INFORMATION FOR SEQ ID NO : 84 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 84 : 

TGCCQACTGT GGAGCCTGAC CCTACCAGTG GCTGAGCTGG GCCTGGGCTA TGCCTCGGAG 60 

GAGAAGGTCA TCTTCCGATA CTGTGCTGGC AGCTGTCCCC AAGAGGCCCG TACCCAGCAC 120 

AGTCTGGTAC TGGCCCGGCT TCGAGGGCGG GGTCGAGCCC ATGGCCGACC CTGCTGCCAG 180 

CCCACCAGCT ATGCTGATGT GACCTTCCTT GATGATCAGC ACCATTGGCA GCAGCTGCCT 240 

CAGCTCTCAG CTGCAGCTTG TGGCTGT 2 67 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TGCCGGCTGT GGAGCCTGAC CCTACCAGTG GCTGAGCTTG GCCTGGGCTA TGCCTCAGAG 60 

GAGAAGATTA TCTTCCGATA CTGTGCTGGC AGCTGTCCCC AAGAGGTCCG TACCCAGCAC 120 

AGTCTGGTGC TGGCCCGTCT TCGAGGGCAG GGTCGAGCTC ATGGCAGACC TTGCTGCCAG 180 

CCCACCAGCT ATGCTGATGT GACCTTCCTT GATGACCACC ACCATTGGCA GCAGCTGCCT 240 

CAGCTCTCAG CCGCAGCTTG TGGCTGT 2 67 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

TGCCGGCTGT GGAGCCTGAC CCTACCAGTG GCTGAGCTTG GCCTGGGCTA TGCCTCAGAG 60 

GAGAAGATTA TCTTCCGATA CTGTGCTGGC AGCTGTCCCC AAGAGGTCCG TACCCAGCAC 120 

AGTCTGGTGC TGGCCCGTCT TCGAGGGCAG GGTCGAGCTC ATGGCAGACC TTGCTGCCAG 180 

CCCACCAGCT ATGCTGATGT GACCTTCCTT GATGACCACC ACCATTGGCA GCAGCTGCCT 240 

CAGCTCTCAG CCGCAGCTTG TGGCTGTGGT GGC 273 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE : amino acid 

( C ) STRANDEDNE S S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Cys Val Leu Thr Ala lie His Leu Asn Val Thr Asp Leu Gly Leu Gly 
15 10 15 

Tyr Glu Thr Lys Glu Glu Leu lie Phe Arg Tyr Cys Ser Gly Ser Cys 
20 25 30 

Glu Ser Ala Glu Thr Met Tyr Asp Lys lie Leu Lys Asn Leu Ser Arg 
35 40 45 

Ser Arg Arg Leu Thr Ser Asp Lys Val Gly Gin Ala Cys Cys Arg Pro 

50 55 60 

Val Ala Phe Asp Asp Asp Leu Ser Phe Leu Asp Asp Asn Leu Val Tyr 
65 70 75 80 

His lie Leu Arg Lys His Ser Ala Lys Arg Cys Gly Cys lie 
85 90 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Cys Gly Leu Arg Glu Leu Glu Val Arg Val Ser Glu Leu Gly Leu Gly 
15 10 15 

Tyr Thr Ser Asp Glu Thr Val Leu Phe Arg Tyr Cys Ala Gly Ala Cys 
20 25 30 

Glu Ala Ala lie Arg lie Tyr Asp Leu Gly Leu Arg Arg Leu Arg Gin 
35 40 45 
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Arg Arg Arg Val Arg Arg Glu Arg Ala Arg Ala His Pro Cys Cys Arg 
50 55 60 

Pro Thr Ala Tyr Glu Asp Glu Val Ser Phe Leu Asp Val His Ser Arg 
65 70 75 80 

Tyr His Thr Leu Gin Glu Leu Ser Ala Arg Glu Cys Ala Cys Val 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala Glu Leu Gly Leu Gly 
15 10 15 

Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr Cys Ala Gly Ser Cys 
20 25 30 

Pro Gin Glu Ala Arg Thr Gin His Ser Leu Val Leu Ala Arg Leu Arg 
35 40 45 

Gly Arg Gly Arg Ala His Gly Arg Pro Cys Cys Gin Pro Thr Ser Tyr 
50 55 60 

Ala Asp Val Thr Phe Leu Asp Asp Gin His His Trp Gin Gin Leu Pro 
65 70 75 80 

Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
85 90 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
TGCCTCAGAG GAGAAGATTA TC 22 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE : amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala Ser Glu Glu Lys lie lie 
1 5 

(2) INFORMATION FOR SEQ ID NO : 92 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Leu Gly Leu Gly Tyr Glu Thr Lys Glu Glu Leu lie Phe Arg Tyr Cys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

Leu Gly Leu Gly Tyr Thr Ser Asp Glu Thr Val Leu Phe Arg Tyr Cys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Leu Gly Leu Gly Tyr Ala Ser Glu Glu Lys lie lie Phe Arg Tyr Cys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
AGTCGGGGTT GGGGTATGCC TCA 23 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLiOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
TATGCCTCAG AGGAGAAGAT TATCTT 2 6 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXHiE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CCTCAGAGGA GAAGATTATC TTCCGATACT GTGCTGGCAG CTGTCCCCAA GAGGTCCGTA 60 

CCCAGCACAG TCTGGTGCTG GCCCGTCTTC GAGGGCAGGG TCGAGCTCAT GGCAGACCTT 120 

GCTGCCAGCC CACCAGCTAT GCTGATGTGA CCTTCCTTGA TGACCACCAC CATTGGCAGC 180 

AGCTGCCTCA GCTCTCAGCC GCAGCTTGTG GCTGTGGTGG CTGAAGGCGG CCAGCCTGGT 240 

CTCTCAGAAT CACAAGCAAG AGGCAGCCTT TGAAAGGCTC AGGTGACGTT ATTAGAAACT 300 

TGCATAGGAG AAGATTAAGA AGAGAAAGGG GACCTG 336 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Ala Cys Cys Arg Pro Val Ala Phe Asp Asp Asp Leu Ser Phe Leu Asp 
15 10 15 

Asp 
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(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Pro Cys Cys Arg Pro Thr Ala Tyr Glu Asp Glu Val Ser Phe Lys Asp 
15 10 15 

Val 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE : amino acid 

( C ) STRANDEDNESS : 

( D ) TO POLOG Y : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Pro Cys Cys Gin Pro Thr Ser Tyr Ala Asp Val Thr Phe Leu Asp Asp 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TCATCAAGGA AGGTCACATC AGCATA 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
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CCACCACAGC CACAAGCTGC GGSTGAGAGC TG 32 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Ala Leu Ala Gly Ser 

1 5 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

Val Arg lie Pro Gly Gly Leu Pro Thr Pro Gin Phe Leu Leu Ser Lys 
15 10 15 

Pro Ser Leu Cys Leu Thr lie Leu Leu Tyr Leu Ala Leu Gly Asn Asn 
20 25 30 

His Val Arg Leu Pro Arg Ala Leu Ala Gly Ser 
35 40 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



GAOGGACCTG 


GACGCCCCAT 


CAGGGTAAGA 


ATTCCTGGGG 


GCCTCCCGAC 


TCCCCAATTC 


60 


CTTCTCTCAA 


AGCCCTCACT 


TTGCCTTACA 


ATCCTACTCT 


ACCTTGCACT 


AGGTAACAAC 


120 


CATGTCCGTC 


TTCCAAGAGC 


CTTGGCTGGT 


TCATGCCGAC 


TGTGGA6CCT 


OACCCTACCA 


180 


GTGGCTGAGC 


TGGGCCTGGG 


CTATGCCTCG 


QAGGAQAAGG 


TCATCTTCCG 


ATACTGTGCT 


240 


GGCAGCTGTC 


CCCAAQAGGC 


CCGTACCCAG 


CACAGTCTGG 


TACTGGCCCG 


GCTTCOAGGG 


300 


CGGGGTCGAG 


CCCATGGCCG 


ACCCTGCTGC 


CAGCCCACCA 


GCTATGCTGA 


TGTGACCTTC 


360 
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CTTGATGATC AGCACCATTG GCAGCAGCTG CCTCAGCTCT CAGCTGCAGC TTGTGGCTGT 420 

GGTGGCTGAA GGAGGCCAGT CTGGTGTCTC AGAATCACAA GCATGAGACA GGCTGGGCTT 48 0 

TGAAAGGCTC AGGTGACATT ACTAGAAATT TGCATAGGTA AAGATAAGAA QGGAAAGGAC 540 

CAGG 544 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 



CCTCAGAGGA 


GAAGATTATC 


TTCCGATACT 


GTGCTGGCAG 


CTGTCCCCAA 


GAGGTCCGTA 


60 


CCCAGCACAG 


TCTGGTGCTG 


GCCCGTCTTC 


GAGGGCAGGG 


TCGAGCTCAT 


GGCAGACCTT 


12 0 


GCTGCCAGCC 


CACCAGCTAT 


GCTGATGTGA 


CCTTCCTTGA 


TGACCACCAC 


CATTGGCAGC 


180 


AGCTGCCTCA 


GCTCTCAGCC 


GCAGCTTGTG 


GCTGTGGTGG 


CTGAAGGCGG 


CCAGCCTGGT 


240 


CTCTCAGAAT 


CACAAGCAAG 


AGGCAGCCTT 


TGAAAGGCTC 


AGGTGACGTT 


ATTAGAAACT 


300 


TGCATAGGAG 


AAGATTAAGA 


AGAGAAAGGG 


GACCTG 






336 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 



TGCCGGCTGT 


GGAGCCTGAC 


CCTACCAGTG 


GCTGAGCTTG 


GCCTG6GCTA 


TGCCTCAGAG 


60 


GAGAAGATTA 


TCTTCCGATA 


CTGTGCTGGC 


AGCTGTCCCC 


AAGAGGTCCG 


TACCCAGCAC 


120 


AGTCTGGTGC 


TGGCCCGTCT 


TCGAGGGCAG 


GGTCGAGCTC 


ATGGCAGACC 


TTGCTGCCAQ 


180 


CCCACCAGCT 


ATGCTGATGT 


GACCTTCCTT 


GATGACCACC 


ACCATTGGCA 


GCAGCTGCCT 


240 


CAGCTCTCAG 


CCGCAGCTTG 


TGGCTGTGGT 


GGCTGAAGGC 


GGCCAGCCTG 


GTCTCTCAGA 


300 


ATCACAAGCA 


AGAGGCAGCC 


TTTGAAAGGC 


TCAGGTGACG 


TTATTAGAAA 


CTTGCATAGG 


360 


AGAAGATTAA 


GAAGAGAAAG 


GGGACCTGAT 


T 






391 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "SERINE, THREONINE, OR 

ALANINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Val Xaa Xaa Leu Gly Leu Gly Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Phe Arg Tyr Cys Xaa Gly Xaa Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "ASPARTIC ACID, GLUTAMIC 
ACID OR NO AMINO ACID" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "VALINE OR LEUCINE" 



( ix) FEATURE : 
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(A) NAME/KEY: Modified- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "SERINE OR THREONINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "valine or aspartic acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Asp Xaa Xaa Xaa Phe Leu Asp Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 111 : 

Glu Gly Pro Gly Arg Pro lie Arg Val Arg lie Pro Gly Gly Leu Pro 
15 10 15 

Thr Pro Gin Phe Leu Leu Ser Lys Pro Ser Leu Cys Leu Thr He Leu 
20 25 30 

Leu Tyr Leu Ala Leu Gly Asn Asn His Val Arg Leu Pro Arg Ala Leu 
35 40 45 

Ala Gly Ser Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala Glu Leu 
50 55 60 

Gly Leu Gly Tyr Ala Ser Glu Glu Lys Val He Phe Arg Tyr Cys Ala 
65 70 75 80 

Gly Ser Cys Pro Gin Glu Ala Arg Thr Gin His Ser Leu Val Leu Ala 
85 90 95 

Arg Leu Arg Gly Arg Gly Arg Ala His Gly Arg Pro Cys Cys Gin Pro 
100 105 110 

Thr Ser Tyr Ala Asp Val Thr Phe Leu Asp Asp Gin His His Trp Gin 

115 120 125 

Gin Leu Pro Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Ala Leu Pro Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "THREONINE, GLUTAMIC ACID OR 

LYSINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "VALINE, LEUCINE OR 
ISOLEUCINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "LEUCINE OR ISOLEUCINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Glu Xaa Xaa Xaa Phe Arg Tyr Cys Xaa Gly Xaa Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO : 114 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "ARGININE OR GLUTAMINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "THREONINE, VALINE OR 
ISOLEUCINE" 
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(ix) FEATURE: 

(A) NAME/KEY: Modif led- site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "ALANINE OR SERINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif led- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "TYROSINE OR PHENYLALANINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID, ASPARTIC 
ACID OR ALANINE" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID, ASPARTIC 
ACID OR NO AMINO ACID" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /note= "VALINE OR LEUCINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note= "SERINE OR THREONINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /note= "ASPARTIC ACID OR VALINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Cys Cys Xaa Pro Xaa Xaa Xaa Xaa Asp Xaa Xaa Xaa Phe Leu Asp Xaa 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GTNDGNGANY TGGGNYTGGG NTA 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
GANBTNWCNT TYYTNGANG 19 
(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
GANBTNWCNT TYYTNGANGW 20 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
TTYMGNTAYT GYDSNGGNDS NTG 23 
(2) INFORMATION FOR SEQ ID NO : 119 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
GTNDGNGANY TGGGNYTNGG 20 
(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
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GTNDGNGANY TGGGNYTGGG NTT 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
WCNTCNARRA ANGWNAVNTC 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 122 : 
WCNTCNARRA ANGWNAVNT 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 123 : 
CANSHNCCNS HRCARTANCK R7^ 23 
(2) INFORMATION FOR SEQ ID NO : 124 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
CANSHNCCNS HRCARTANCK RAANA 25 
(2) INFORMATION FOR SEQ ID NO: 12 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "THREONINE, SERINE OR 

ALANINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Val Xaa Xaa Leu Gly Leu Gly Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "ASPARTIC ACID OR GLUTAMIC 

ACID" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "VALINE OR LEUCINE" 

( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /notes= "THREONINE OR SERINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "ASPARTIC ACID OR GLUTAMIC 

ACID" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "ASPARTIC ACID OR VALINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Xaa Xaa Xaa Phe Leu Xaa Xaa 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "SERINE OR ALANINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "SERINE OR ALANINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Phe Arg Tyr Cys Xaa Gly Xaa Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "THREONINE, SERINE OR 

ALANINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "ASPARTIC ACID OR GLUTAMIC 

ACID" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Val Xaa Xaa Leu Gly Leu Gly 

1 5 

(2) INFORMATION FOR SEQ ID NO:129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



( ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "THREONINE, SERINE OR 
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ALANINE" 



( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "GLUTAMIC ACID OR ASPARTIC 

ACID" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Val Xaa Xaa Leu Gly Leu Gly Phe 

1 5 

(2) INFORMATION FOR SEQ ID NQ:130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "ISOLEUCINE OR LEUCINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "SERINE OR ALANINE" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "SERINE OR ALANINE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 0: 

Xaa Phe Arg Tyr Cys Xaa Gly Xaa Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

ATGGCTGCAG GAAGACTTCG GATCCTGTGT CTGCTGCTCC TGTCCTTGCA CCCGAGCCTC 60 

GGCTGGGTCC TTGATCTTCA AQAGGCTTCT GTGGCAGATA AGCTCTCATT TGQGAAOATG 120 

GCAGAGACTA GAGGGACCTG GACGCCCCAT CAGGGTAAGA ATTCCTGGGG GCCTCCCGAC 180 

TCCCCAATTC CTTCTCTCAA AGCCCTCACT TTGCCTTACA ATCCTACTCT ACCTTGCACT 240 

AGGTAACAAC CATGTCCGTC TTCCAAGAGC CTTGGCTGGT TCATGCCGAC TGTGGAGCCT 300 
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GACCCTACCA GTGGCTGAGC TGGGCCTGGG CTATGCCTCG GAGGAGAAGG TCATCTTCCG 3 60 

ATACTGTGCT GGCAGCTGTC CCCAAGAGGC CCGTACCCAG CACAGTCTGG TACTGGCCCG 42 0 

GCTTCGAGGG CGGGGTCGAG CCCATGGCCG ACCCTGCTGC CAGCCCACCA GCTATGCTGA 4 80 

TGTGACCTTC CTTGATGATC AGCACCATTG GCAGCAGCTG CCTCAGCTCT CAGCTGCAGC 540 

TTGTGGCTGT GGTGGCTGA 55 9 
(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 

Met Ala Ala Gly Arg Leu Arg lie Leu Cys Leu Leu Leu Leu Ser Leu 
15 10 15 

His Pro Ser Leu Gly Trp Val Leu Asp Leu Gin Glu Ala Ser Val Ala 

20 25 30 

Asp Lys Leu Ser Phe Gly Lys Met Ala Glu Thr Arg Gly Thr Trp Thr 
35 40 45 

Pro His Gin Gly Lys Asn Ser Trp Gly Pro Pro Asp Ser Pro lie Pro 
50 55 60 

Ser Leu Lys Ala Leu Thr Leu Pro Tyr Asn Pro Thr Leu Pro Cys Thr 
65 70 75 80 



Arg 



(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

Trp Leu Gin Glu Asp Phe Gly Ser Cys Val Cys Cys Ser Cys Pro Cys 
15 10 15 

Thr Arg Ala Ser Ala Gly Ser Leu lie Phe Lys Arg Leu Leu Trp Gin 
20 25 30 

lie Ser Ser His Leu Gly Arg Trp Gin Arg Leu Glu Gly Pro Gly Arg 
35 40 45 

Pro lie Arg Val Arg lie Pro Gly Gly Leu Pro Thr Pro Gin Phe Leu 
50 55 60 
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Leu Ser Lys 
65 



Pro Ser Leu Cys Leu Thr lie Leu Leu Tyr Leu Ala Leu 
70 75 80 



Gly Asn Asn 



His Val Arg Leu Pro Arg Ala Leu Ala Gly Ser Cys Arg 

85 90 95 



Leu Trp Ser 



Leu Thr Leu Pro Val Ala Glu Leu Gly Leu Gly Tyr Ala 
100 105 110 



Ser Glu Glu 
115 



Lys Val lie Phe Arg Tyr Cys Ala Gly Ser Cys Pro Gin 
120 125 



Glu Ala Arg 
13 0 



Thr Gin His Ser Leu Val Leu Ala Arg Leu Arg Gly Arg 
135 140 



Gly Arg Ala 
14 5 



His Gly Arg Pro Cys Cys Gin Pro Thr Ser Tyr Ala Asp 
150 155 160 



Val Thr Phe 



Leu Asp Asp Gin His His Trp Gin Gin Leu Pro Gin Leu 
165 170 175 



Ser Ala Ala 



Ala Cys Gly Cys Gly Gly 
180 185 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 134 : 

ATGGCTGCAG GAAGACTTCG GATCTTGTTT CTGCTGCTCC TGTCCTTGCA CCTGGGCCTT 6 0 

GGCTGGGTCC TTGATCTTCA AGAGGCTCCT GCGGCAGATG AGCTCTCATC TGGGAAAATG 12 0 

GCAGAGACTG GAAGGACCTG GAAGCCCCAT CAGGGTAAGA ATTCTTGGGG GCCTCCTAAC 180 

TCTACAGTTC TTCCTCTCAA AGCCCTCACT TTGCCTCACA ATCCTATTCT ACCTTGCACT 240 

AGGTAACAAC AATGTCCGCC TTCCAAGAGC CTTACCTGGT TTGTGCCGGC TGTGGAGCCT 300 

GACCCTACCA GTGGCTGAGC TTGGCCTGGG CTATGCCTCA GAGGAGAAGA TTATCTTCCG 360 

ATACTGTGCT GGCAGCTGTC CCCAAGAGGT CCGTACCCAG CACAGTCTGG TGCTGGCCCG 420 

TCTTCGAGGG CAGGGTCGAG CTCATGGCAG ACCTTGCTGC CAGCCCACCA GCTATGCTGA 480 

TGTGACCTTC CTTGATGACC ACCACCATTG GCAGCAGCTG CCTCAGCTCT CAGCCGCAGC 540 

TTGTGGCTGT GGTGGCTGA 559 
(2) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Met Ala Ala Gly Arg Leu Arg lie Leu Phe Leu Leu Leu Leu Ser Leu 
15 10 15 

His Leu Gly Leu Gly Trp Val Leu Asp Leu Gin Glu Ala Pro Ala Ala 

20 25 30 

Asp Glu Leu Ser Ser Gly Lys Met Ala Glu Thr Gly Arg Thr Trp Lys 
35 40 45 

Pro His Gin Gly Lys Asn Ser Trp Gly Pro Pro Asn Ser Thr Val Leu 

50 55 60 

Pro Leu Lys Ala Leu Thr Leu Pro His Asn Pro lie Leu Pro Cys Thr 
65 70 75 80 



Arg 



(2) INFORMATION FOR SEQ ID NO: 13 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Trp Leu Gin Glu Asp Phe Gly Ser Cys Phe Cys Cys Ser Cys Pro Cys 
15 10 15 

Thr Trp Ala Leu Ala Gly Ser Leu lie Phe Lys Arg Leu Leu Arg Gin 
20 25 30 

Met Ser Ser His Leu Gly Lys Trp Gin Arg Leu Glu Gly Pro Gly Ser 
35 40 45 

Pro lie Arg Val Arg lie Leu Gly Gly Leu Leu Thr Leu Gin Phe Phe 
50 55 60 

Leu Ser Lys Pro Ser Leu Cys Leu Thr lie Leu Phe Tyr Leu Ala Leu 
65 70 75 80 

Gly Asn Asn Asn Val Arg Leu Pro Arg Ala Leu Pro Gly Leu Cys Arg 
85 90 95 

Leu Trp Ser Leu Thr Leu Pro Val Ala Glu Leu Gly Leu Gly Tyr Ala 

100 105 110 

Ser Glu Glu Lys lie lie Phe Arg Tyr Cys Ala Gly Ser Cys Pro Gin 
115 120 125 

Glu Val Arg Thr Gin His Ser Leu Val Leu Ala Arg Leu Arg Gly Gin 
130 135 140 

Gly Arg Ala His Gly Arg Pro Cys Cys Gin Pro Thr Ser Tyr Ala Asp 
145 150 155 160 

Val Thr Phe Leu Asp Asp His His His Trp Gin Gin Leu Pro Gin Leu 
165 170 175 

Ser Ala Ala Ala Cys Gly Cys Gly Gly 
180 185 
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(2) INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
AATCCCCAGG ACAGGCAGGG AAT 23 
(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
CGGTACCCAG ATCTTCAGCC ACCACAGCCA CAAGC 35 
(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 
GGACTATCAT ATGGCCCACC ACCACCACCA CCACCACCAC QACGACGACG ACAAGGCCTT 60 
GGCTGGTTCA TGCCGA 76 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
CGGTACCCAG ATCTTCAGCC ACCACAGCCA CAAGC 35 
(2) INFORMATION FOR SEQ ID NO: 141: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Ala Leu Ala Gly Ser Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala 
15 10 15 

Glu Leu Gly Leu Gly Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr 
20 25 30 

Cys Ala Gly Ser Cys Pro Gin Glu Ala Arg Thr Gin His Ser Leu Val 
35 40 45 

Leu Ala Arg Leu Arg Gly Arg Gly Arg Ala His Gly Arg Pro Cys Cys 
50 55 60 

Arg Pro Thr Ala Tyr Glu Asp Glu Val Ser Phe Leu Asp Val His Ser 
65 70 75 80 

Arg Tyr His Thr Leu Gin Glu Leu Ser Ala Arg Glu Cys Ala Cys Val 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 142 : 
TAATACGACT CACTATAGGG GAA 23 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
TCGTCTTCGT AAGCAGTCGG ACGGCAGCAG GGTCGGCCAT GGGCTCGAC 49 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
TGCTGCCGTC CGACTGCTTA CGAAGACGA 2 9 

(2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GTTATGCTAG TTATTGCTCA GCGGT 25 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECXJLE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Leu Glu Val Arg Val Ser 
15 

Thr Val Leu Phe Arg Tyr 
30 

lie Tyr Asp Leu Gly Leu 
45 



Arg Glu Arg Ala Arg Ala 
60 

Asp Val Thr Phe Leu Asp 
75 80 

Leu Ser Ala Ala Ala Cys 
95 

Gly Cys Gly Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



Pro Gly Ala Arg Pro 

1 5 

Glu Leu Gly Leu Gly 
20 

Cys Ala Gly Ala Cys 
35 

Arg Arg Leu Arg Gin 
50 

His Pro Cys Cys Gin 
65 

Asp Gin His His Trp 
85 



Cys Gly Leu Arg Glu 
10 

Tyr Thr Ser Asp Glu 
25 



Glu Ala Ala lie Arg 
40 



Arg Arg Arg Val Arg 
55 

Pro Thr Ser Tyr Ala 
70 

Gin Gin Leu Pro Gin 
90 



BNSDOCI D: <WO_991 4235A 1J_> 



wo 99/14235 PCT/US98/19163 



175 

( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
CACATCAGCA TAGCTGGTGG GCTGGCAGCA CGGGTGAGCA CGAGCACGTT 50 
(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
TGCTGCCAGC CCACCAGCTA TGCTG 25 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:149: 
CCTCGGAGGA GAAGGTCATC TTC 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Cys Cys Val Arg Gin Leu Tyr lie Asp Phe Arg Lys Asp Leu Gly Trp 
15 10 15 

Lys Trp lie His Glu Pro Lys Gly Tyr His Ala Asn Phe Cys Leu Gly 
20 25 30 

Pro Cys Pro Tyr lie Trp Ser Leu Asp Thr Gin Tyr Ser Lys Val Leu 
35 40 45 

Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser Ala Ala Pro Cys Cys 
50 55 60 
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Val Pro Gin Ala Leu Glu Pro Leu Pro lie Val Tyr Tyr Val Gly Arg 
65 70 75 80 

Lys Pro Lys Val Glu Gin Leu Ser Asn Met lie Val Arg Ser Cys Lys 
85 90 95 

Cys Ser 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

Cys Cys Leu Arg Pro Leu Tyr lie Asp Phe Lys Arg Asp Leu Gly Tarp 
15 10 15 

Lys Trp lie His Glu Pro Lys Gly Tyr Asn Ala Asn Phe Cys Ala Gly 
20 25 30 

Ala Cys Pro Tyr Leu Trp Ser Ser Asp Thr Gin His Ser Arg Val Leu 
35 40 45 

Ser Leu Tyr Asn Thr lie Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys 
50 55 60 

Val Ser Gin Asp Leu Glu Pro Leu Thr lie Leu Tyr Tyr lie Gly Lys 

65 70 75 80 

Thr Pro Lys lie Glu Gin Leu Ser Asn Met lie Val Lys Ser Cys Lys 
85 90 95 



Cys Ser 



(2) INFORMATION FOR SEQ ID NO : 152 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Cys Cys Val Arg Pro Leu Tyr lie Asp Phe Arg Gin Asp Leu Gly Trp 
15 10 15 

Lys Trp Val His Glu Pro Lys Gly Tyr Tyr Ala Asn Phe Cys Ser Gly 
20 25 30 

Pro Cys Pro Tyr Leu Arg Ser Ala Asp Thr Thr His Ser Thr Val Leu 
35 40 45 

Gly Leu Tyr Asn Thr Leu Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys 
50 55 60 
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Val Pro Gin Asp Leu 
65 

Thr Pro Lys Val Glu 
85 

Cys Ser 



177 



Glu Pro Leu Thr lie Leu 
70 75 

Gin Leu Ser Asn Met Val 
90 



Tyr Tyr Val Gly Arg 
80 

Val Lys Ser Cys Lys 
95 



(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 
(E) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

Cys Cys Lys Lys Gin Phe Phe Val Ser Phe Lys Asp lie Gly Trp Asn 
15 10 15 

Asp Trp lie lie Ala Pro Ser Gly Tyr His Ala Asn Tyr Cys Glu Gly 
20 25 30 

Glu Cys Pro Ser His lie Ala Gly Thr Ser Gly Ser Ser Leu Ser Phe 
35 40 45 

His Ser Thr Val He Asn His Tyr Arg Met Arg Gly His Ser Pro Phe 
50 55 60 

Ala Asn Leu Lys Ser Cys Cys Val Pro Thr Lys Leu Arg Pro Met Ser 
65 70 75 80 

Met Leu Tyr Tyr Asp Asp Gly Gin Asn He He Lys Lys Asp He Gin 
85 90 95 

Asn Met He Val Glu Glu Cys Gly Cys Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Cys Cys Arg Gin Gin Phe Phe He Asp Phe Arg Leu He Gly Trp Asn 
15 10 15 

Asp Trp He He Ala Pro Thr Gly Tyr Tyr Gly Asn Tyr Cys Glu Gly 
20 25 30 

Ser Cys Pro Ala Tyr Leu Ala Gly Val Pro Gly Ser Ala Ser Ser Phe 
35 40 45 

His Thr Ala Val Val Asn Gin Tyr Arg Met Arg Gly Leu Asn Pro Gly 
50 55 60 
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Thr Val Asn Ser Cys Cys lie Pro Thr Lys Leu Ser Thr Met Ser Met 
65 70 75 80 

Leu Tyr Phe Asp Asp Glu Tyr Asn lie Val Lys Arg Asp Val Pro Asn 
85 90 95 

Met lie Val Glu Glu Cys Gly Cys Ala 
100 105 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

Cys Arg Arg Val Lys Phe Gin Val Asp Phe Asn Leu lie Gly Trp Gly 
15 10 15 

Ser Trp lie lie Tyr Pro Lys Gin Tyr Asn Ala Tyr Arg Cys Glu Gly 
20 25 30 

Glu Cys Pro Asn Pro Val Gly Glu Glu Phe His Pro Thr Asn His Ala 
35 40 45 

Tyr lie Gin Ser Leu Leu Lys Arg Tyr Gin Pro His Arg Val Pro Ser 
50 55 60 

Thr Cys Cys Ala Pro Val Lys Thr Lys Pro Leu Ser Met Leu Tyr Val 
65 70 75 80 

Asp Asn Gly Arg Val Leu Leu Glu His His Lys Asp Met lie Val Glu 
85 90 95 

Glu Cys Gly Cys Leu 
100 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
15 10 15 

Asp Trp lie Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly 
20 25 30 

Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 
35 40 45 

lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Lys lie Pro Lys Ala 
50 55 60 
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Cys Cys Val Pro Thr Glu Leu Ser Ala lie Ser Met Leu Tyr Leu Asp 
65 70 75 80 

Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu 
85 90 95 

Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
15 10 15 

Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly 
20 25 30 

Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 
35 40 45 

lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser lie Pro Lys Ala 
50 55 60 

Cys Cys Val Pro Thr Glu Leu Ser Ala lie Ser Met Leu Tyr Leu Asp 
65 70 75 80 

Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu 
85 90 95 

Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asp 
15 10 15 

Asp Trp lie Val Ala Pro Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly 
20 25 30 

Lys Cys Pro Phe Pro Leu Ala Asp His Phe Asn Ser Thr Asn His Ala 
35 40 45 

Val Val Gin Thr Leu Val Asn Asn Met Asn Pro Gly Lys Val Pro Lys 
50 55 60 
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Ala Cys Cys Val Pro Thr Gin Leu Asp Ser Val Ala Met Leu Tyr Leu 
65 70 75 80 

Asn Asp Gin Ser Thr Val Val Leu Lys Asn Tyr Gin Glu Met Thr Val 
85 90 95 

Val Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp lie lie Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

lie Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe 

65 70 75 80 

Asp Asp Ser Ser Asn Val lie Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ser Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Cys Arg Lys His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp lie lie Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly 

20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

lie Val Gin Thr Leu Val His Leu Met Asn Pro Glu Tyr Val Pro Lys 
50 55 60 
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Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Asn Ser Asn Val lie Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ala Cys Qly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp lie lie Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly 
20 25 30 

Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr Asn His Ala 

35 40 45 

lie Val Gin Thr Leu Val His Phe lie Asn Pro Glu Thr Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu 
15 10 15 

Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala 
35 40 45 

He Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala Val Pro Lys 
50 55 60 
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Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr 
65 70 75 80 

Asp Ser Ser Asn Asn Val lie Leu Arg Lys His Arg Asn Met Val Val 
85 90 95 

Lys Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

Cys Gin Met Gin Thr Leu Tyr lie Asp Phe Lys Asp Leu Gly Trp His 
15 10 15 

Asp Trp lie lie Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser Gly 
20 25 30 

Glu Cys Asn Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

lie Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr His 
65 70 75 80 

Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Met He Val 
85 90 95 

Lys Ser Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp He Gly Trp Ser 
15 10 15 

Glu Trp He He Ser Pro Lys Ser Phe Asp Ala Tyr Tyr Cys Ser Gly 
20 25 30 

Ala Cys Gin Phe Pro Met Pro Lys Ser Leu Lys Pro Ser Asn His Ala 
35 40 45 

Thr He Gin Ser He Val Arg Ala Val Gly Val Val Pro Gly He Pro 
50 55 60 
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Glu Pro Cys Cys Val Pro Glu Lys Met Ser Ser Leu Ser lie Leu Phe 
65 70 75 80 

Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tyr Pro Asn Met Thr 
85 90 95 

Val Glu Ser Cys Ala Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys Asp Val Gly Trp Gin 
15 10 15 

Asn Trp Val lie Ala Pro Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly 
20 25 30 

Glu Cys Pro Tyr Pro Leu Thr Glu lie Leu Asn Gly Ser Asn His Ala 
35 40 45 

lie Leu Gin Thr Leu Val His Ser He Glu Pro Glu Asp He Pro Leu 
50 55 60 

Pro Cys Cys Val Pro Thr Lys Met Ser Pro He Ser Met Leu Phe Tyr 
65 70 75 80 

Asp Asn Asn Asp Asn Val Val Leu Arg His Tyr Glu Asn Met Ala Val 
85 90 95 

Asp Glu Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly Trp His 
15 10 15 

Arg Trp Val He Ala Pro Arg Gly Phe Leu Ala Asn Tyr Cys Gin Gly 
20 25 30 

Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly Gly Pro Pro Ala 
35 40 45 

Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro Gly 
50 55 60 
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Ala Ala Asp Leu Pro Cys Cys Val Pro Ala Arg Leu Ser Pro lie Ser 
65 70 75 80 

Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg Gin Tyr Glu 
85 90 95 

Asp Met Val Val Asp Glu Cys Gly Cys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRT^NDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:167: 

Cys His Arg His Gin Leu Phe lie Asn Phe Gin Asp Leu Gly Trp His 
15 10 15 

Lys Trp Val lie Ala Pro Lys Gly Phe Met Ala Asn Tyr Cys His Gly 
20 25 30 

Glu Cys Pro Phe Ser Met Thr Thr Tyr Leu Asn Ser Ser Asn Tyr Ala 
35 40 45 

Phe Met Gin Ala Leu Met His Met Ala Asp Pro Lys Val Pro Lys Ala 
50 55 60 

Val Cys Val Pro Thr Lys Leu Ser Pro lie Ser Met Leu Tyr Gin Asp 
65 70 75 80 

Ser Asp Lys Asn Val lie Leu Arg His Tyr Glu Asp Met Val Val Asp 
85 90 95 

Glu Cys Gly Cys Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:168: 

Cys Arg Arg Thr Ser Leu His Val Asn Phe Lys Glu lie Gly Trp Asp 
15 10 15 

Ser Trp lie lie Ala Pro Lys Asp Tyr Glu Ala Phe Glu Cys Lys Gly 
20 25 30 

Gly Cys Phe Phe Pro Leu Thr Asp Asn Val Thr Pro Thr Lys His Ala 
35 40 45 

lie Val Gin Thr Leu Val His Leu Gin Asn Pro Lys Lys Ala Ser Lys 
50 55 60 
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Ala Cys Cys Val Pro Thr Lys Leu Asp Ala lie Ser lie Leu Tyr Lys 
65 70 75 80 

Asp Asp Ala Gly Val Pro Thr Leu lie Tyr Asn Tyr Glu Gly Met Lys 
85 90 95 

Val Ala Glu Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

Cys His Arg Val Ala Leu Asn He Ser Phe Gin Glu Leu Gly Trp Glu 
15 10 15 

Arg Trp He Val Tyr Pro Pro Ser Phe He Phe His Tyr Cys His Gly 
20 25 30 

Gly Cys Gly Leu His He Pro Pro Asn Leu Ser Leu Pro Val Pro Gly 

35 40 45 

Ala Pro Pro Thr Pro Ala Gin Pro Tyr Ser Leu Leu Pro Gly Ala Gin 
50 55 60 

Pro Cys Cys Ala Ala Leu Pro Gly Thr Met Arg Pro Leu His Val Arg 
65 70 75 80 

Thr Thr Ser Asp Gly Gly Tyr Ser Phe Lys Tyr Glu Thr Val Pro Asn 
85 90 95 

Leu Leu Thr Gin His Cys Ala Cys He 
100 105 

(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 0: 

Cys Ala Leu Arg Glu Leu Ser Val Asp Leu Arg Ala Glu Arg Ser Val 
15 10 15 

Leu He Pro Glu Thr Tyr Gin Ala Asn Asn Cys Gin Gly Ala Cys Gly 
20 25 30 

Trp Pro Gin Ser Asp Arg Asn Pro Arg Tyr Gly Asn His Val Val Leu 
35 40 45 

Leu Leu Lys Met Gin Ala Arg Gly Ala Thr Leu Ala Arg Pro Pro Cys 
50 55 60 
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Cys Val Pro Thr Ala Tyr Thr Gly Lys Leu Leu lie Ser Leu Ser Glu 
65 70 75 80 

Glu Arg lie Ser Ala His His Val Pro Asn Met Val Ala Thr Glu Cys 
85 90 95 

Gly Cys Arg 



(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
< D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

Cys Glu Leu His Asp Phe Ser Leu Ser Phe Ser Gin Leu Lys Trp Asp 
15 10 15 

Asn Trp lie Val Ala Pro His Ser Tyr Asn Pro Ser Tyr Cys Lys Gly 
20 25 30 

Asp Cys Pro Ser Ala Val Ser His Arg Tyr Gly Ser Pro Val His Thr 
35 40 45 

Met Val Gin Asn Met lie Tyr Glu Lys Leu Asp Pro Ser Val Pro Ser 
50 55 60 

Pro Ser Cys Val Pro Gly Lys Tyr Ser Pro Leu Ser Val Leu Thr lie 

65 70 75 80 

Glu Pro Asp Gly Ser lie Ala Tyr Lys Glu Tyr Glu Asp Met Met Ala 
85 90 95 

Thr Ser Cys Thr Cys Arg 
100 

. (2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

Cys Val Leu Thr Ala lie His Leu Asn Val Thr Asp Leu Gly Leu Gly 
15 10 15 

Tyr Glu Thr Lys Glu Glu Leu lie Phe Arg Tyr Cys Ser Gly Ser Cys 
20 25 30 

Asp Ala Ala Glu Thr Thr Tyr Asp Lys lie Leu Lys Asn Leu Ser Arg 
35 40 45 

Asn Arg Arg Leu Val Ser Asp Lys Val Gly Gin Ala Cys Cys Arg Pro 
50 55 60 
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He Ala Phe Asp Asp Asp Leu Ser Phe Leu Asp Asp Asn Leu Val Tyr 
65 70 75 80 

His He Leu Arg Lys His Ser Ala Lys Arg Cys Gly Cys He 
85 90 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

Cys Gly Leu Arg Glu Leu Glu Val Arg Val Ser Glu Leu Gly Leu Gly 
1 5 10 15 

Tyr Ala Ser Asp Glu Thr Val Leu Phe Arg Tyr Cys Ala Gly Ala Cys 
20 25 30 

Glu Ala Ala Ala Arg Val Tyr Asp Leu Gly Leu Arg Arg Leu Arg Gin 
35 40 45 

Arg Arg Arg Leu Arg Arg Glu Arg Val Arg Ala Gin Pro Cys Cys Arg 

50 55 60 

Pro Thr Ala Tyr Glu Asp Glu Val Ser Phe Leu Asp Ala His Ser Arg 
65 70 75 80 

Tyr His Thr Val His Glu Leu Ser Ala Arg Glu Cys Ala Cys Val 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
GCCTTGGCTG GTTCATGCCG ACTGTGGAGC CTGACCCTAC CAGTGGCTGA GCTGGGCCTG 6 0 

GGCTATGCCT CGGAGGAGT^ GGTCATCTTC CGATACTGTG CTGGCAGCTG TCCCCAAGAG 12 0 

GCCCGTACCC AGCACAGTCT GGTACTGGCC CGGCTTCGAG GGCGGGGTCG AGCCCATGGC 180 

CGACCCTGCT GCCAGCCCAC CAGCTATGCT GATGTGACCT TCCTTGATGA TCAGCACCAT 240 

TGGCAGCAGC TGCCTCAGCT CTCAGCTGCA GCTTGTGGCT GTGGTGGCTG A 2 91 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

GTT^GAATTC CTGGGGGCCT CCCGACTCCC CAATTCCTTC TCTCAAAGCC CTCACTTTGC 60 

CTTACAATCC TACTCTACCT TGCACTAGGT AACAACCATG TCCGTCTTCC T^GAGCCTTG 12 0 

GCTGGTTCAT GCCGACTGTG GAGCCTGACC CTACCAGTGG CTGAGCTGGG CCTGGGCTAT 18 0 

GCCTCGGAGG AGAAGGTCAT CTTCCGATAC TGTGCTGGCA GCTGTCCCCA AGAGGCCCGT 240 

ACCCAGCACA GTCTGGTACT GGCCCGGCTT CGAGGGCGGG GTCGAGCCCA TGGCCGACCC 3 00 

TGCTGCCAGC CCACCAGCTA TGCTGATGTG ACCTTCCTTG ATGATCAGCA CCATTGGCAG 360 

CAGCTGCCTC AGCTCTCAGC TGCAGCTTGT GGCTGTGGTG GCTGA 405 



(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

GCCTTACCTG GTTTGTGCCG GCTGTGGAGC CTGACCCTAC CAGTGGCTGA GCTTGGCCTG 60 

GGCTATGCCT CAGAGGAGAA GATTATCTTC CGATACTGTG CTGGCAGCTG TCCCCAAGAG 120 

GTCCQTACCC AGCACAGTCT GGTGCTGGCC CGTCTTCGAG GGCAGG6TCG AGCTCATGGC 180 

AGACCTTGCT GCCAGCCCAC CAGCTATGCT GATGTGACCT TCCTTGATGA CCACCACCAT 240 

TGGCAGCAGC TGCCTCAGCT CTCAGCCGCA GCTTGTGGCT GTGGTGGCTG A 291 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

ATQGCTGCAG GAAGACTTCG GATCCTGTGT CTGCTGCTCC TGTCCTTGCA CCCGAGCCTC 60 

GGCTGGGTCC TTGATCTTCA AGAGQCTTCT GTGGCAGATA AGCTCTCATT TGGGAAGATG 120 

GCAGAGACTA GAGGGACCTG GACGCCCCAT CAGGGTAAGA ATTCCTGGGG GCCTCCCGAC 180 

TCCCCAATTC CTTCTCTCAA AGCCCTCACT TTGCCTTACA ATCCTACTCT ACCTTGCACT 240 

AGGTAACAAC CATGTCCGTC TTCCAAGAGC CTTGGCTGGT TCATGCCGAC TGTGGAGCCT 300 
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GACCCTACCA GTGGCTGAGC TGGGCCTGGG CTATGCCTCG GAGGAGAAGG TCATCTTCCG 3 60 

ATACTGTGCT GGCAGCTGTC CCCAAGAGGC CCGTACCCAG CACAGTCTGG TACTGGCCCG 42 0 

GCTTCGAGGG CGGGGTCGAG CCCATGGCCG ACCCTGCTGC CAGCCCACCA GCTATGCTGA 480 

TGTGACCTTC CTTGATGATC AGCACCATTG GCAGCAGCTG CCTCAGCTCT CAGCTGCAGC 540 

TTGTGGCTGT GGTGGCTGAA GGAGGCCAGT CTGGTGTCTC AGAATCACAA GCATGAGACA 600 

GGCTGGGCTT TGAAAGGCTC AGGTGACATT ACTAGAAATT TGCATAGGTA AAGATAAGAA 660 

GGGAAAGGAC CAGGGGTTTT TTGTTTCTTT CTTTGCTTGC TTGTTAGTTT TTTTTTTTTT 72 0 

XTT 723 
(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

TACCGACGTC CTTCTGAAGC CTAGGACACA GACGACGAGG ACAGGAACGT GGGCTCGGAG 60 

CCGACCCAGG AACTAGAAGT TCTCCGAAGA CACCGTCTAT TCGAGAGTAA ACCCTTCTAC 120 

CGTCTCTGAT CTCCCTGGAC CTGCGGGGTA GTCCCATTCT TAAGGACCCC CGGAGGGCTG 180 

AGGGGTTAAG GAAGAGAGTT TCGGGAGTGA AACGGAATGT TAGGATGAGA TGGAACGTGA 240 

TCCATTGTTG GTACAGGCAG AAGGTTCTCG GAACCGACCA AGTACGGCTG ACACCTCGGA 300 

CTGGGATGGT CACCGACTCG ACCCGGACCC GATACGGAGC CTCCTCTTCC AGTAGAAGGC 360 

TATGACACGA CCGTCGACAG GGGTTCTCCG GGCATGGGTC GTGTCAGACC ATGACCGGGC 420 

CGAAGCTCCC GCCCCAGCTC GGGTACCGGC TGGQACGACG GTCGGGTGGT CGATACGACT 480 

ACACTGGAAG GAACTACTAG TCGTGGTAAC CGTCGTCGAC GGAGTCGAGA GTCQACGTCG 540 

AACACCGACA CCACCGACTT CCTCCGGTCA GACCACAGAG TCTTAGTGTT CGTACTCTGT 600 

CCGACCCGAA ACTTTCCGAG TCCACTGTAA TGATCTTTAA ACGTATCCAT TTCTATTCTT 660 

CCCTTTCCTG GTCCCCAAAA AACAAAGAAA GAAACGAACG AACAATCAAA AAAAAAAAAA 72 0 

AAA 723 
(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
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ATGGCTGCAG GAAGACTTCG GATCCTGTGT CTGCTGCTCC TGTCCTTGCA CCCGAGCCTC 60 

GGCTGGGTCC TTGATCTTCA AGAGGCTTCT GTGGCAGATA AGCTCTCATT TGGGAAGATG 120 

GCAGAGACTA GAGGQACCTG GACGCCCCAT CAGGGTAACA ACCATGTCCG TCTTCCAAGA 180 

GCCTTGGCTG GTTCATGCCG ACTGTGGAGC CTGACCCTAC CAGTGGCTGA GCTGGGCCTG 24 0 

GGCTATGCCT CGGAGGAGAA GGTCATCTTC CGATACTGTG CTGGCAGCTG TCCCCAAGAG 3 00 

GCCCGTACCC AGCACAGTCT GGTACTGGCC CGGCTTCGAG GGCGGGGTCG AGCCCATGGC 3 60 

CGACCCTGCT GCCAGCCCAC CAGCTATGCT GATGTGACCT TCCTTGATGA TCAGCACCAT 420 

TGGCAGCAGC TGCCTCAGCT CTCAGCTGCA GCTTGTGGCT GTGGTGGCTG A 471 
(2) INFORMATION FOR SEQ ID NO: 18 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECXJLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

TACCGACGTC CTTCTGAAGC CTAGGACACA GACGACGAGG ACAGGAACGT GGGCTCGGAG 60 

CCGACCCAGG AACTAGAAGT TCTCCGAAGA CACCGTCTAT TCGAGAGTAA ACCCTTCTAC 120 

CGTCTCTGAT CTCCCTGGAC CTGCGGGGTA GTCCCATTGT TGGTACAGGC AGAAGGTTCT 180 

CGGAACCGAC CAAGTACGGC TGACACCTCG GACTGGGATG GTCACCGACT CGACCCGGAC 240 

CCGATACGGA GCCTCCTCTT CCAGTAGAAG GCTATGACAC GACCGTCGAC AGGGGTTCTC 3 00 

CGGGCATGGG TCGTGTCAGA CCATGACCGG GCCGAAGCTC CCGCCCCAGC TCGGGTACCG 360 

GCTGGGACGA CGGTCGGGTG GTCGATACGA CTACACTGGA AGQAACTACT AGTCGTGGTA 420 

ACCGTCGTCG ACGGAGTCGA GAGTCQACGT CGAACACCGA CACCACCGAC T 471 
(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

ATGGCTGCAG GAAGACTTCG GATCCTGTGT CTGCTGCTCC TGTCCTTGCA CCCGAGCCTC 60 

GGCTGGGTCC TTGATCTTCA AGAGGCTTCT GTGGCAGATA AGCTCTCATT TGQQAAGATG 120 

GCAGAGACTA GAGGGACCTG GACGCCCCAT CAGGGTAACA ACCATGTCCG TCTTCCAAGA 180 

(2) INFORMATION FOR SEQ ID NO: 182: 



BNSCX3CID: <WO_9914235A1_l_> 



wo 99/14235 



PCTAJS98/19163 



191 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

TACCGACGTC CTTCTGAAGC CTAGGACACA GACGACGAGG ACAGGAACGT GGGCTCGGAG 60 

CCGACCCAGG AACTAGAAGT TCTCCGAAGA CACCGTCTAT TCGAGAGTAA ACCCTTCTAC 120 

CGTCTCTGAT CTCCCTGGAC CTGCGGGGTA GTCCCATTGT TGGTACAGGC AGAAGGTTCT 180 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 

GCCTTGGCTG GTTCATGCCG ACTGTGGAGC CTGACCCTAC CAGTGGCTGA GCTGGGCCTG 60 

GGCTATGCCT CGGAGGAGAA GGTCATCTTC CGATACTQTG CTGGCAGCTG TCCCCAAGAG 120 

GCCCGTACCC AGCACAGTCT GGTACTGGCC CGGCTTCGAG GGCGGGGTCG AGCCCATGGC 180 

CGACCCTGCT GCCAGCCCAC CAGCTATGCT GATGTGACCT TCCTTGATGA TCAGCACCAT 240 

TGGCAGCAGC TGCCTCAGCT CTCAGCTGCA GCTTGTGGCT GTGGTGGCTG A 291 



(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

CGGAACCGAC CAAGTACGGC TGACACCTCQ GACTGGGATG GTCACCGACT CGACCCGGAC 60 

CCGATACGGA GCCTCCTCTT CCAGTAGAAG GCTATGACAC GACCGTCGAC AGGGGTTCTC 12 0 

CGGGCATGGG TCGTGTCAGA CCATGACCGG GCCGAAGCTC CCGCCCCAGC TCGGGTACCG 180 

GCTGGGACGA CGGTCGGGTG GTCGATACGA CTACACTGGA AGGAACTACT AGTCGTGGTA 240 

ACCGTCGTCG ACGGAGTCGA GAGTCGACGT CGAACACCGA CACCACCGAC T 291 
(2) INFORMATION FOR SEQ ID NO: 185: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:185: 

Met Ala Ala Gly Arg Leu Arg lie Leu Cys Leu Leu Leu Leu Ser Leu 
15 10 15 

His Pro Ser Leu Gly Trp Val Leu Asp Leu Gin Glu Ala Ser Val Ala 
20 25 30 

Asp Lys Leu Ser Phe Gly Lys Met Ala Glu Thr Arg Gly Thr Trp Thr 
35 40 45 

Pro His Gin Gly Asn Asn His Val Arg Leu Pro Arg Ala Leu Ala Gly 
50 55 60 

Ser Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala Glu Leu Gly Leu 
65 70 75 80 

Gly Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr Cys Ala Gly Ser 

85 90 95 

Cys Pro Gin Glu Ala Arg Thr Gin His Ser Leu Val Leu Ala Arg Leu 
100 105 110 

Arg Gly Arg Gly Arg Ala His Gly Arg Pro Cys Cys Gin Pro Thr Ser 
115 120 125 

Tyr Ala Asp Val Thr Phe Leu Asp Asp Gin His His Trp Gin Gin Leu 
130 135 140 

Pro Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ II 

Met Ala Ala Gly Arg Leu Arg lie 
1 5 

His Pro Ser Leu Gly Trp Val Leu 
20 

Asp Lys Leu Ser Phe Gly Lys Met 
35 40 

Pro His Gin Gly Asn Asn His Val 
50 55 

(2) INFORMATION FOR SEQ ID NO: 187: 



I NO: 186: 

Leu Cys Leu Leu Leu Leu Ser Leu 
10 15 

Asp Leu Gin Glu Ala Ser Val Ala 
25 30 

Ala Glu Thr Arg Gly Thr Trp Thr 
45 

Arg Leu Pro Arg 
60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

Ala Leu Ala Gly Ser Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala 
15 10 15 

Glu Leu Gly Leu Gly Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr 
20 25 30 

Cys Ala Gly Ser Cys Pro Gin Glu Ala Arg Thr Gin His Ser Leu Val 
35 40 45 

Leu Ala Arg Leu Arg Gly Arg Gly Arg Ala His Gly Arg Pro Cys Cys 
50 55 60 

Gin Pro Thr Ser Tyr Ala Asp Val Thr Phe Leu Asp Asp Gin His His 
65 70 75 80 

Trp Gin Gin Leu Pro Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

ATGGCTGCAG GAAGACTTCG GATCTTGTTT CTGCTGCTCC TGTCCTTGCA CCTGGGCCTT 60 

GGCTGGGTCC TTGATCTTCA AGAGGCTCCT GCGGCAGATG AGCTCTCATC TGGGAAAATG 120 

GCAGAGACTQ OAAGGACCTG GAAGCCCCAT CAGGGTAAGA ATTCTTGGGG GCCTCCTAAC 180 

TCTACAGTTC TTCCTCTCAA AGCCCTCACT TTGCCTCACA ATCCTATTCT ACCTTGCACT 240 

AGGTAACAAC AATGTCCGCC TTCCAAGAGC CTTACCTGGT TTGTGCCGGC TGTGQAGCCT 300 

GACCCTACCA GTGGCTGAGC TTGGCCTGGG CTATGCCTCA GAGGAGAAGA TTATCTTCCG 360 

ATACTGTGCT GGCAGCTGTC CCCAAGAGGT CCGTACCCAG CACAGTCTGG TGCTGGCCCG 420 

TCTTCGAGGG CAGGGTCGAG CTCATGGCAG ACCTTGCTGC CAGCCCACCA GCTATGCTGA 480 

TGTGACCTTC CTTGATGACC ACCACCATTG GCAGCAGCTG CCTCAGCTCT CAGCCGCAGC 540 

TTGTGGCTGT GGTGGCTQA 559 

(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



TACCGACGTC 


CTTCTGAAGC 


CTAGAACAAA 


GACGACGAGG 


ACAGGAACGT 


GGACCCGGAA 


60 


CCGACCCAGG 


AACTAGAAGT 


TCTCCGAGGA 


CGCCGTCTAC 


TCGAGAGTAG 


ACCCTTTTAC 


120 


CGTCTCTGAC 


CTTCCTGGAC 


CTTCGGGGTA 


GTCCCATTCT 


T7VAGAACCCC 


CGGAGGATTG 


180 


AGATGTCAAG 


AAGGAGAGTT 


TCGGGAGTGA 


AACGGAGTGT 


TAGGATAAGA 


TGGAACGTGA 


240 


TCCATTGTTG 


TTACAGGCGG 


AAGGTTCTCG 


GAATGGACCA 


AACACGGCCG 


ACACCTCGGA 


300 


CTGGGATGGT 


CACCGACTCG 


AACCGGACCC 


GATACGGAGT 


CTCCTCTTCT 


AATAGAAGGC 


360 


TATGACACGA 


CCGTCGACAG 


GGGTTCTCCA 


GGCATGGGTC 


GTGTCAGACC 


ACGACCGGGC 


420 


AGAAGCTCCC 


GTCCCAGCTC 


GAGTACCGTC 


TGGAACGACG 


GTCGGGTGGT 


CGATACGACT 


480 


ACACTGGAAG 


GAACTACTGG 


TGGTGGTAAC 


CGTCGTCGAC 


GGAGTCGA6A 


GTCGGCGTCG 


540 


AACACCGACA 


CCACCGACT 










559 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 



ATGGCTGCAG 


GAAGACTTCG 


GATCTTGTTT 


CTGCTGCTCC 


TGTCCTTGCA 


CCTGGGCCTT 


60 


GGCTGGGTCC 


TTGATCTTCA 


AQAGGCTCCT 


GCGGCAGATG 


AGCTCTCATC 


TGGGAAAATG 


120 


GCAGAGACTG 


GAAGGACCTG 


GAAGCCCCAT 


CAGGGTAACA 


ACAATGTCCG 


CCTTCCAAGA 


180 


GCCTTACCTG 


GTTTGTGCCG 


GCTGTGGAGC 


CTGACCCTAC 


CAGTGGCTGA 


GCTTGGCCTG 


240 


GGCTATGCCT 


CAGAGGAGAA 


GATTATCTTC 


CGATACTGTG 


CTGGCAGCTG 


TCCCCAAGAG 


300 


GTCCGTACCC 


AGCACAGTCT 


GGTGCTGGCC 


CGTCTTCGAG 


GGCAGGGTCG 


AGCTCATGGC 


360 


AGACCTTGCT 


GCCAGCCCAC 


CAGCTATGCT 


GATGTGACCT 


TCCTTGATGA 


CCACCACCAT 


420 


TGGCAGCAGC 


TGCCTCAGCT 


CTCAGCCGCA 


GCTTGTGGCT 


GTGGTGGCTG 


A 


471 


(2) INFORMATION FOR SEQ ID NO: 191: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TACCGACGTC CTTCTGAAGC CTAGAACAAA GACGACGAGG ACAGGAACGT GGACCCGGAA 60 

CCGACCCAGG AACTAGAAGT TCTCCGAGGA CGCCGTCTAC TCGAGAGTAG ACCCTTTTAC 120 

CGTCTCTGAC CTTCGTGGAC CTTCGGGGTA GTCCCATTGT TGTTACAGGC GGAAGGTTCT 180 

CGGAATGGAC CAAACACGGC CGACACCTCG GACTGGGATG GTCACCGACT CGAACCGGAC 24 0 

CCGATACGGA GTCTCCTCTT CTAATAGAAG GCTATGACAC GACCGTCGAC AGGGGTTCTC 30 0 

CAGGCATGGG TCGTGTCAGA CCACGACCGG GCAGAAGCTC CCGTCCCAGC TCGAGTACCG 360 

TCTGGAACGA CGGTCGGGTG GTCGATACGA CTACACTGGA AGGAACTACT GGTGGTGGTA 420 

ACCGTCGTCG ACGGAGTCGA GAGTCGGCGT CGAACACCGA CACCACCGAC T 471 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

ATGGCTGCAG GAAGACTTCG GATCTTGTTT CTGCTGCTCC TGTCCTTGCA CCTGGGCCTT 60 

GGCTGGGTCC TTGATCTTCA AGAGGCTCCT GCGGCAGATG AGCTCTCATC TGGQAAAATG 12 0 

GCAGAGACTG GAAGGACCTG GAAGCCCCAT CAGGGTAACA ACAATGTCCG CCTTCCAAGA 180 



(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:193: 

TACCGACGTC CTTCTGAAGC CTAGAACAAA GACGACGAGG ACAGGAACGT GGACCCGGAA 60 

CCGACCCAGG AACTAGAAGT TCTCCGAGGA CGCCGTCTAC TCGAGAGTAG ACCCTTTTAC 120 

CGTCTCTGAC CTTCCTGGAC CTTCGGGGTA GTCCCATTGT TGTTACAGGC GGAAGGTTCT 180 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
GCCTTACCTG GTTTGTGCCG GCTGTGGAGC CTGACCCTAC CAGTGGCTGA GCTTGGCCTG 60 
GGCTATGCCT CAGAGGAGAA GATTATCTTC CGATACTGTG CTGGCAGCTG TCCCCAAGAG 120 
GTCCGTACCC AGCACAGTCT GGTGCTGGCC CGTCTTCGAG GGCAGGGTCG AGCTCATGGC 180 
AGACCTTGCT GCCAGCCCAC CAGCTATGCT GATGTGACCT TCCTTGATGA CCACCACCAT 240 
TGGCAGCAGC TGCCTCAGCT CTCAGCCGCA GCTTGTGGCT GTGGTGGCTG A 291 
(2) INFORMATION FOR SEQ ID NO:195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
CGGAATGGAC CAAACACGGC CGACACCTCG GACTGGGATG GTCACCGACT CGAACCGGAC 60 
CCGATACGGA GTCTCCTCTT CTAATAGAAG GCTATGACAC GACCGTCGAC AGGGGTTCTC 120 
CAGGCATGGG TCGTGTCAGA CCACGACCGG GCAGAAGCTC CCGTCCCAGC TCGAGTACCG 180 
TCTGGAACGA CGGTCGGGTG GTCGATACGA CTACACTGGA AGGAACTACT GGTGGTGGTA 240 
ACCGTCGTCG ACGGAGTCGA GAGTCGGCGT CGAACACCGA CACCACCGAC T 2 91 

(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Met Ala Ala Gly Arg Leu Arg lie Leu Phe Leu Leu Leu Leu Ser Leu 
15 10 15 

His Leu Gly Leu Gly Trp Val Leu Asp Leu Gin Glu Ala Pro Ala Ala 
20 25 30 

Asp Glu Leu Ser Ser Gly Lys Met Ala Glu Thr Gly Arg Thr Trp Lys 
35 40 45 

Pro His Gin Gly Asn Asn Asn Val Arg Leu Pro Arg Ala Leu Pro Gly 
50 55 60 
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Leu Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala Glu Leu Gly Leu 
65 70 75 80 

Gly Tyr Ala Ser Glu Glu Lys lie lie Phe Arg Tyr Cys Ala Gly Ser 
85 90 95 

Cys Pro Gin Glu Val Arg Thr Gin His Ser Leu Val Leu Ala Arg Leu 
100 105 110 

Arg Gly Gin Gly Arg Ala His Gly Arg Pro Cys Cys Gin Pro Thr Ser 
115 120 125 

Tyr Ala Asp Val Thr Phe Leu Asp Asp His His His Trp Gin Gin Leu 
130 135 140 

Pro Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

Met Ala Ala Gly Arg Leu Arg lie Leu Phe Leu Leu Leu Leu Ser Leu 
15 10 15 

His Leu Gly Leu Gly Trp Val Leu Asp Leu Gin Glu Ala Pro Ala Ala 
20 25 30 

Asp Glu Leu Ser Ser Gly Lys Met Ala Glu Thr Gly Arg Thr Trp Lys 
35 40 45 

Pro His Gin Gly Asn Asn Asn Val Arg Leu Pro Arg 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STR7UOT)EDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Ala Leu Pro Gly Leu Cys Arg Leu Trp Ser Leu Thr Leu Pro Val Ala 
15 10 15 

Glu Leu Gly Leu Gly Tyr Ala Ser Glu Glu Lys lie lie Phe Arg Tyr 
20 25 30 

Cys Ala Gly Ser Cys Pro Gin Glu Val Arg Thr Gin His Ser Leu Val 
35 40 45 

Leu Ala Arg Leu Arg Gly Gin Gly Arg Ala His Gly Arg Pro Cys Cys 
50 55 60 
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Gin Pro Thr Ser Tyr Ala Asp Val Thr Phe Leu Asp Asp His His His 
65 70 75 80 

Trp Gin Gin Leu Pro Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

GCCCTGTCTG GTCCATGCCA GCTGTGGAGC CTGACCCTGT CCGTGGCAGA GCTAGGCCTG 60 

GGCTACGCCT CAGAGGAGAA GGTCATCTTC CGCTACTGCG CCGGCAGCTG CCCCCGTGGT 120 

GCCCGCACCC AGCATGGCCT GGCGCTGGCC CGGCTGCAGG GCCAGGGCCG AGCCCACGGT 180 

GGGCCCTGCT GCCGGCCCAC TCGCTACACC GACGTGGCCT TCCTCGATGA CCGCCACCGC 240 

TGGCAGCGGC TGCCCCAGCT CTCGGCGGCT GCCTGCGGCT GTGGTGGCTG A 291 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: 

CGGGACAGAC CAGGTACGGT CGACACCTCG GACTGGGACA GGCACCGTCT CGATCCGGAC 60 

CCGATGCGGA GTCTCCTCTT CCAGTAGAAG GCGATGACGC GGCCGTCGAC GGGGQCACCA 120 

CGGGCGTGGG TCGTACCGGA CCGCQACCGG GCCGACGTCC CGGTCCCGGC TCGGGTGCCA 180 

CCCGGGACGA CGGCCGGGTG AGCGATGTGG CTGCACCGGA AGGAGCTACT GGCGGTGGCG 240 

ACCGTCGCCG ACGGGGTCGA OAGCCGCCGA CGGACGCCGA CACCACCGAC T 291 
(2) INFORMATION FOR SEQ ID NO:201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 



BNSDOCID: <WO 9914235A1 I > 



wo 99/14235 



PCT/US98/19163 



199 

GCCCTGTCTG GTCCATGCCA GCTGTGGAGC CTGACCCTGT CCGTGGCAGA GCTAGGCCTG 6 0 

GGCTACGCCT CAGAGGAGAA GGTCATCTTC CGCTACTGCG CCGGCAGCTG CCCCCGTGGT 12 0 

GCCCGCACCC AGCATGGCCT GGCGCTGGCC CGGCTGCAGG GCCAGGGCCG AGCCCACGGC 180 

GGGCCCTGCT GCCGGCCCAC TCGCTACACC GACGTGGCCT TCCTCGATGA CCGCCACCGC 240 

TGGCAGCGGC TGCCCCAGCT CTCGGCGGCT GCCTGCGGCT GTGGTGGCTG A 291 



(2) INFORMATION FOR SEQ ID NO: 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECUIiE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 

CGGGACAGAC CAGGTACGGT CGACACCTCG GACTGGGACA GGCACCGTCT CGATCCGGAC 60 

CCGATGCGGA GTCTCCTCTT CCAGTAGAAG GCGATGACGC GGCCGTCGAC GGGGGCACCA 12 0 

CGGGCGTGGG TCGTACCGGA CCGCGACCGG GCCGACGTCC CGGTCCCGGC TCGGGTGCCG 180 

CCCGGGACGA CGGCCGGGTG AGCGATGTGG CTGCACCGGA AGGAGCTACT GGCGGTGGCG 240 

ACCGTCGCCG ACGGGGTCGA GAGCCGCCGA CGGACGCCGA CACCACCGAC T 291 



(2) INFORMATION FOR SEQ ID NO:203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

ATGGCCGTAG GGAAGTTCCT GCTGGGCTCT CTGCTGCTCC TGTCCCTGCA GCTGGGACAG 60 

GGCTGGGGCC CCGATGCCCG TGGGGTTCCC GTGGCCGATG GAQAGTTCTC GTCTGAACAG 120 

GTGGCAAAGG CTGGAGGGAC CTGGCTGGGC ACCCACCGCC CCCTTGCCCG CCTGCGCCGA 180 

GCCCTGTCTG GTCCATGCCA GCTGTGGAGC CTGACCCTGT CCGTGGCAGA GCTAGGCCTG 240 

GGCTACGCCT CAGAGGAGAA GGTCATCTTC CGCTACTGCG CCGGCAGCTG CCCCCGTGGT 300 

GCCCGCACCC AGCATGGCCT GGCGCTGGCC CGGCTGCAGG GCCAGGGCCG AGCCCACGGT 360 

GGGCCCTGCT GCCGGCCCAC TCGCTACACC GACGTGGCCT TCCTCGATGA CCGCCACCGC 420 

TGGCAGCGGC TGCCCCAGCT CTCGGCGGCT GCCTGCGGCT GTGGTGGCTG A 471 
(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 
TACCGGCATC CCTTCAAGGA CGACCCGAGA GACGACGAGG ACAGGGACGT CGACCCTGTC 60 

CCGACCCCGG GGCTACGGGC ACCCCAAGGG CACCGGCTAC CTCTCAAGAG CAGACTTGTC 12 0 

CACCGTTTCC GACCTCCCTG GACCGACCCG TGGGTGGCGG GGGAACGGGC GGACGCGGCT 180 

CGGGACAGAC CAGGTACGGT CGACACCTCG GACTGGGACA GGCACCGTCT CGATCCGGAC 240 

CCGATGCGGA GTCTCCTCTT CCAGTAGAAG GCGATGACGC GGCCGTCGAC GGGGGCACCA 3 00 

CGGGCGTGGG TCGTACCGGA CCGCGACCGG GCCGACGTCC CGGTCCCGGC TCGGGTGCCA 3 60 

CCCGGGACGA CGGCCGGGTG AGCGATGTGG CTGCACCGGA AGGAGCTACT GGCGGTGGCG 420 

ACCGTCGCCG ACGGGGTCGA GAGCCGCCGA CGGACGCCGA CACCACCGAC T 471 
(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

ATGGCCGTAG GGAAGTTCCT GCTGGGCTCC CTGCTGCTCC TGTCCCTGCA GCTGGGACAG 60 

GGCTGGGGCC CCGATGCCCG TGGGGTTCCC GTGGCCGATG GAGAGTTCTC GTCTGAACAG 12 0 

GTGGCAAAGG CTGGAGGGAC CTGGCTGGGC ACCCACCGCC CCCTTGCCCG CCTGCGCCGA X80 

GCCCTGTCTG GTCCATGCCA GCTGTGGAGC CTGACCCTGT CCGTGGCA6A GCTAGGCCTG 240 

GGCTACGCCT CAGAGGAGAA GGTCATCTTC CGCTACTGCG CCGGCAGCTG CCCCCGTGGT 300 

GCCCGCACCC AGCATGGCCT GGCGCTGGCC CGGCTGCAGG GCCAQGGCCG AGCCCACGGC 3 60 

GGGCCCTGCT GCCGGCCCAC TCGCTACACC GACGTGGCCT TCCTCGATGA CCGCCACCGC 420 

TGGCAGCGGC TGCCCCAGCT CTCGGCGGCT GCCTGCGGCT GTGGTGGCTG A 471 
(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 
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TACCGGCATC CCTTCAAGGA CGACCCGAGG GACGACGAGG ACAGGGACGT CGACCCTGTC 



60 



CCGACCCCGG GGCTACGGGC ACCCCZAAGGG CACCGGCTAC CTCTCAAGAG CAGACTTGTC 



120 



CACCGTTTCC GACCTCCCTG GACCGACCCG TGGGTGGCGG GGGAACGGGC GGACGCGGCT 



180 



CGGGACAGAC CAGGTACGGT CGACACCTCG GACTGGGACA GGCACCGTCT CGATCCGGAC 



240 



CCGATGCGGA GTCTCCTCTT CCAGTAGAAG GCGATGACGC GGCCGTCGAC GGGGGCACCA 



300 



CGGGCGTGGG TCGTACCGGA CCGCGACCGG GCCGACGTCC CGGTCCCGGC TCGGGTGCCG 



360 



CCCGGGACGA CGGCCGGGTG AGCGATGTGG CTGCACCGGA AGGAGCTACT GGCGGTGGCG 



420 



ACCGTCGCCG ACGGGGTCGA GAGCCGCCGA CGGACGCCGA CACCACCGAC T 



471 



(2) INFORMATION FOR SEQ ID NO:207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 07: 
ATGGCCGTAG GGAAGTTCCT GCTGGGCTCT CTGCTGCTCC TGTCCCTGCA GCTGGGACAG 6 0 

GGCTGGGGC 

(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
TACCGGCATC CCTTCAAGGA CGACCCGAGA GACGACGAGG ACAGGGACGT CGACCCTGTC 60 
CCGACCCCG 69 
(2) INFORMATION FOR SEQ ID NO:209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 
ATGGCCGTAG GGAAGTTCCT GCTGGGCTCC CTGCTGCTCC TGTCCCTGCA GCTGGGACAG 60 
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GGCTGGGGC 69 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 
TACCGGCATC CCTTCAAGGA CGACCCGAGG GACGACGAGG ACAGGGACGT CGACCCTGTC 60 
CCGACCCCG 69 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 
CCCGATGCCC GTGGGGTTCC CGTGGCCGAT GGAGAGTTCT CGTCTGAACA GGTGGCAAAG 60 
GCTGGAGGGA CCTGGCTGGG CACCCACCGC CCCCTTGCCC GCCTGCGCCG A 111 
(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 
GGGCTACGGG CACCCCAAGG GCACCGGCTA CCTCTCAAGA GCAGACTTGT CCACCGTTTC 60 
CGACCTCCCT GGACCGACCC GTGGGTGGCG GGGGAAC6GG CGGACGCGGC T 111 
(2) INFORMATION FOR SEQ ID NO:213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 
ATGGCCGTAG GGAAGTTCCT GCTGGGCTCT CTGCTGCTCC TGTCCCTGCA GCTGGGACAG 60 
GGCTGGGGCC CCGATGCCCG TGGGGTTCCC GTGGCCGATG GAGAGTTCTC GTCTGAACAG 120 
GTGGCAAAGG CTGGAGGGAC CTGGCTGGGC ACCCACCGCC CCCTTGCCCG CCTGCGCCGA 180 

(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 
TACCGGCATC CCTTCAAGGA CGACCCGAGA GACGACGAGG ACAGGGACGT CGACCCTGTC 60 
CCGACCCCGG GGCTACGGGC ACCCCAAGGG CACCGGCTAC CTCTCAAGAG CAGACTTGTC 120 
CACCGTTTCC GACCTCCCTG GACCGACCCG TGGGTGGCGG GGGAACGGGC GGACGCGGCT 180 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 
ATGGCCGTAG GGAAGTTCCT GCTGGGCTCC CTGCTGCTCC TGTCCCTGCA GCTGGGACAG 60 
GGCTGGGGCC CCGATGCCCG TGGGGTTCCC GTGGCCGATG GAGAGTTCTC GTCTGAACAG 120 
GTGGCAAAGG CTGGAGGGAC CTGGCTGGGC ACCCACCGCC CCCTTGCCCG CCTGCGCCGA 180 

(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 
TACCGGCATC CCTTCAAGGA CGACCCGAGG GACGACGAGG ACAGGGACGT CGACCCTGTC 60 
CCGACCCCGG GGCTACGGGC ACCCCAAGGG CACCGGCTAC CTCTCAAGAG CAGACTTGTC 120 
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CACCGTTTCC GACCTCCCTG GACCGACCCG TGGGTGGCGG GGGAACGGGC GGACGCGGCT 180 

(2) INFORMATION FOR SEQ ID NO:217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 

Met Ala Val Gly Lys Phe Leu Leu Gly Ser Leu Leu Leu Leu Ser Leu 
15 10 15 

Gin Leu Gly Gin Gly Trp Gly Pro Asp Ala Arg Gly Val Pro Val Ala 

20 25 30 

Asp Gly Glu Phe Ser Ser Glu Gin Val Ala Lys Ala Gly Gly Thr Trp 
35 40 45 

Leu Gly Thr His Arg Pro Leu Ala Arg Leu Arg Arg Ala Leu Ser Gly 
50 55 60 

Pro Cys Gin Leu Trp Ser Leu Thr Leu Ser Val Ala Glu Leu Gly Leu 
65 70 75 80 

Gly Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr Cys Ala Gly Ser 
85 90 95 

Cys Pro Arg Gly Ala Arg Thr Gin His Gly Leu Ala Leu Ala Arg Leu 
100 105 110 

Gin Gly Gin Gly Arg Ala His Gly Gly Pro Cys Cys Arg Pro Thr Arg 
115 120 125 

Tyr Thr Asp Val Ala Phe Leu Asp Asp Arg His Arg Trp Gin Arg Leu 
130 135 140 

Pro Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

Met Ala Val Gly Lys Phe Leu Leu Gly Ser Leu Leu Leu Leu Ser Leu 
15 10 15 

Gin Leu Gly Gin Gly Trp Gly Pro Asp Ala Arg Gly Val Pro Val Ala 
20 25 30 

Asp Gly Glu Phe Ser Ser Glu Gin Val Ala Lys Ala Gly Gly Thr Trp 
35 40 45 
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Leu Gly Thr His Arg Pro Leu Ala Arg Leu Arg Arg 
50 55 60 

.(2) INFORMATION FOR SEQ ID NO:219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:219: 

Met Ala Val Gly Lys Phe Leu Leu Gly Ser Leu Leu Leu Leu Ser Leu 
15 10 15 

Gin Leu Gly Gin Gly Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 22 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 

Pro Asp Ala Arg Gly Val Pro Val Ala Asp Gly Glu Phe Ser Ser Glu 
15 10 15 

Gin Val Ala Lys Ala Gly Gly Thr Trp Leu Gly Thr His Arg Pro Leu 
20 25 30 

Ala Arg Leu Arg Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

Ala Leu Ser Gly Pro Cys Gin Leu Trp Ser Leu Thr Leu Ser Val Ala 
15 10 15 

Glu Leu Gly Leu Gly Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr 
20 25 30 

Cys Ala Gly Ser Cys Pro Arg Gly Ala Arg Thr Gin His Gly Leu Ala 
35 40 45 
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Leu Ala Arg Leu Gin Gly Gin Gly Arg Ala His Gly Gly Pro Cys Cys 

50 55 60 

Arg Pro Thr Arg Tyr Thr Asp Val Ala Phe Leu Asp Asp Arg His Arg 
65 70 75 80 

Trp Gin Arg Leu Pro Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: 

Cys Gin Leu Trp Ser Leu Thr Leu Ser Val Ala Glu Leu Gly Leu Gly 

15 10 15 

Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr Cys Ala Gly Ser Cys 
20 25 30 

Pro Arg Gly Ala Arg Thr Gin His Gly Leu Ala Leu Ala Arg Leu Gin 
35 40 45 

Gly Gin Gly Arg Ala His Gly Gly Pro Cys Cys Arg Pro Thr Arg Tyr 
50 55 60 

Thr Asp Val Ala Phe Leu Asp Asp Arg His Arg Trp Gin Arg Leu Pro 
65 70 75 80 

Gin Leu Ser Ala Ala Ala Cys Gly Cys Gly Gly 
85 90 

(2) INFORMATION FOR SEQ ID NO: 22 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

Cys Gin Leu Tarp Ser Leu Thr Leu Ser Val Ala Glu Leu Gly Leu Gly 
15 10 15 

Tyr Ala Ser Glu Glu Lys Val lie Phe Arg Tyr Cys Ala Gly Ser Cys 

20 25 30 

Pro Arg Gly Ala Arg Thr Gin His Gly Leu Ala Leu Ala Arg Leu Gin 
35 40 45 

Gly Gin Gly Arg Ala His Gly Gly Pro Cys Cys Arg Pro Thr Arg Tyr 
50 55 60 
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Thr Asp Val Ala Phe Leu Asp Asp Arg His Arg Trp Gin Arg Leu Pro 
65 70 75 80 

Gin Leu Ser Ala Ala Ala Cys Gly Cys 
85 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Ala Leu Ser Gly Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
GTSASYGASY TGGGYCTGGG CTAY 
(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:226: 
TTYMGSTACT GCRSMGGCKC YTGC 
(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:227: 
RWAGGCSRTS GGKCKGCARC AKGS 
(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 
MKCRTCYARR AASGACASST C 21 
(2) INFORMATION FOR SEQ ID NO:229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 
CGGCTTGTGA CCGAGCTGGG CCTGGGCTAC GCCTCAGAGG AGAAGGTCAT CTTCCGCTAC 60 
TGCGCCGGCA GCTGCCCCCG TGGTGCCCGC ACCCAGCATG GCCTGGCGCT GGCCCGGCTG 120 
CAGGGCCAGG GCCGAGCCCA CGGCGGGCCC TGCTGCCGCC CCATGGCC 168 
(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
GAGGAGAAGG TCATCTTCCG 20 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
CCCGTGGGCT CGGCCCTGGC 20 
(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
AGAGGAGAAG GTCATCTTCC GCTA 24 
(2) INFORMATION FOR SEQ ID NO:233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 
CTCGGCCCTG GCCCTGCAGC 20 
<2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRT^EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 



TGCAGCCGGG CCAGCGCCAG 
(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
CGCGGATCCA TGCCTGGATT CGAGGGTGCA G 31 
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(2) INFORMATION FOR SEQ ID NO: 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 
CGCGGATCCA TGGCCGTAGG GAAGTTCCTG C 31 
(2) INFORMATION FOR SEQ ID NO: 23 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237: 
CTCCCAAGCT TTTACTTGTC ATCGTCGTCC TTGTAGTCGC CACCACAGCC GCAGGCAGCC 60 



(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:238: 
CTCCCAAGCT TTTACTTGTC ATCGTCGTCC TTGTAGTCTC GAGGAAGGCC ACGTCGGTG 59 
(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
TCAGCCACCA CAGCCGCAGG CAGCC 25 
(2) INFORMATION FOR SEQ ID NO: 240: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 

CATAAT^TAG GTGTGGAGTC GCAAAAAGTT TAAAGAAGAG AAAGGAACCA GAAAAAAAAA 60 

TAGAAAGCGC 70 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 
CATAAAATAG GTGTGGAGTC GCGAAAAGTT TA7VAGAGAGT AAGGAACCAG AAAAAAAAAT 60 
AGAAAGCGC €9 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
CATAAAATAG GTGTGGAGTC GCGAAGTTTA AAGAGAGTAA GGAACCAGAA AAAAAAAATA 60 
GAAAGCGC 68 
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What: is Claimed is : 

1. An isola-ted and purified growth factor 
comprising persephin or fragment thereof or 
conservatively substituted variant thereof. 

2. The isolated and purified growth factor of 
claim 1 comprising a polypeptide sequence which has at 
least about 75% sequence identity with SEQ ID NO: 79, SEQ 
ID NO: 82 or SEQ ID NO: 223 or conservatively substituted 

5 variants thereof. 

3. The isolated and purified growth factor of 
claim 2 comprising a polypeptide sequence as set forth in 
SEQ ID NO: 187, SEQ ID NO: 198, SEQ ID NO: 221 or 
conservatively substituted variants thereof. 

4. The isolated and purified growth factor of 
claim 3 which promotes survival in mesencephalic cells. 

5. An isolated and purified polypeptide 
comprising: 

(a) a pre-pro persephin as set forth in SEQ ID 
NO:217, SEQ ID NO: 185, or SEQ ID NO: 196; 
5 (b) a prepro- region of persephin as set forth in 

SEQ ID N0:218, SEQ ID NO: 186, or SEQ ID NO: 197; 

(c) a pre- region of persephin as set forth in SEQ 
ID NO: 219; 

(d) a pro- region of persephin as set forth in SEQ 
10 ID NO: 220; or 

(e) conservatively substituted variants thereof. 

6. The isolated and purified growth factor of 
claim 1 comprising a polypeptide containing a sequence of 
amino acids having at least about 65% sequence identity 
with SEQ ID NO: 79 or SEQ ID NO: 82, or SEQ ID NO: 223 

5 wherein the growth factor is from a non-mammalian 
species . 

7. A method for obtaining a neurturin-persephin- 
GDNF family member growth factor comprising: 

(a) isolating (1) from a human genomic or cDNA 
library, a clone that hybridizes with a polynucleotide 
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5 comprising a persephin, neurturin or GDNF sequence or 
fragmen-ts thereof or (2) from a human genomic or cDNA 
template, a clone using a polymerase chain reaction 
method with degenerate primers of a conserved region of 
any two of persephin, neurturin or GDNF or fragments of 
10 said conserved regions; and 

( b ) sequencing said clone . 

8. An isolated and purified growth factor that is 
a neurturin-persephin-GDNF family member comprising a 
polypeptide having between about 30% and about 75% 
sequence identity with persephin, between about 30% and 
5 about 75% sequence identity with neurturin and between 
about 30% and about 75% sequence identity with GDNF 
wherein said factor is comprised of a conserved region 
sequence of amino acids having at least a 62.5 percent 
sequence identity with SEQ ID NO: 108 or at least a 62.5 
10 percent sequence identity with SEQ ID NO: 109 or at least 
a 50 percent sequence identity with SEQ ID NO: 110. 

13. A pan-growth factor comprising a fragment of 
the persephin polypeptide according to claim 1 and a 
fragment of at least one growth factor from the TGF-B 
superfamily other than persephin. 

14. An isolated and purified nucleic acid molecule 
or nucleic acid molecule complementary thereto comprising 
a nucleotide sequence encoding a growth factor of claim 1 
or a fragment of said nucleotide sequence consisting of 

5 at least 15 contiguous nucleotides. 

15. The isolated and purified nucleic acid 
molecule or nucleic acid molecule complementary thereto 
of claim 14 comprising a nucleotide sequence encoding a 
persephin polypeptide that promotes survival in 

5 mesencephalic cells wherein said nucleic acid molecule or 
complement thereto specifically hybridizes to SEQ ID 
N0:183, SEQ ID N0:184, SEQ ID N0:194, SEQ ID NO:195, SEQ 
ID NO: 199, SEQ ID NO: 200, SEQ ID NO: 201, or SEQ ID 
NO: 202. 
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16. The isolated and purified nucleic acid 
molecule or nucleic acid molecule complementary thereto 
of claim 15 comprising SEQ ID NO: 183, SEQ ID NO: 194, SEQ 
ID NO: 199 or SEQ ID NO: 201. 

17. A vector comprising expression regulatory 
elements operably linked to a nucleic acid molecule of 
claim 14. 

18. A host cell transformed with the vector of 
claim 17. 

19. An isolated and purified nucleic acid molecule 
comprising : 

( a ) a pre-pro persephin nucleotide sequence as set 
forth in SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 190, SEQ 
5 ID NO:191, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
or SEQ ID NO: 206 or a polynucleotide that specifically 
hybridizes to SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID 
NO:190, SEQ ID NO: 191, SEQ ID NO:203, SEQ ID N0:204, SEQ 
ID NO: 205, or SEQ ID NO : 206; 
10 (b) a pre-pro region of a persephin polynucleotide 

as set forth in SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 
NO:192, SEQ ID NO: 193, SEQ ID NO:213, SEQ ID NO:214, SEQ 
ID NO:215, or SEQ ID NO:216; 

(c) a pre- region of a persephin polynucleotide as 
15 set forth in SEQ ID NOS:207, SEQ ID NO: 208, SEQ ID 

NO: 209, or SEQ ID NO: 210; 

(d) a pro- region of a persephin polynucleotide as 
set forth in SEQ ID NO: 211, or SEQ ID NO: 212; or 

(e) fragment thereof comprising at least 15 
20 contiguous nucleotides. 

20. A recombinant method comprising: 

(a) subcloning a polynucleotide encoding the 
growth factor of claim 1 into an expression vector 
comprising regulatory elements operably linked to the 

5 polynucleotide; 

(b) transforming a host cell with the expression 

vector ; 
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(c) growing the host cell in a host cell culture; 

and 

(d) harvesting the growth factor and/or the 
polynucleotide from the host cell culture. 

21. Isolated and purified antibodies which are 
capable of reacting with a growth factor as defined in 
claim 1 or an epitope thereof. 

22. A method for detecting the presence of a 
growth factor in a sample from a patient comprising 
reacting antibodies according to claim 21 with a growth 
factor present in the sample and detecting a binding of 

5 the antibodies with the growth factor. 

23. A kit for detecting the presence of a growth 
factor in a sample from a patient comprising antibodies 
of claim 21 which are capable of detectably reacting with 
said growth factor, packaged in a container. 

24. A method for preventing or treating cellular 
degeneration or insufficiency in an individual comprising 
administering to the individual a therapeutically 
effective amount of the growth factor of claim 1 or a 

5 polynucleotide encoding the growth factor of claim 1 . 

25. The method of claim 24 wherein the cellular 
degeneration or insufficiency is (a) neuronal 
degeneration resulting from peripheral neuropathy, 
amyotrophic lateral sclerosis, Alzheimer's disease, 

5 Parkinson's disease, Huntington's disease. Ischemic 
stroke, acute brain injury, acute spinal cord injury, 
nervous system tumors, multiple sclerosis, or infection; 
(b) hematopoietic cell degeneration or insufficiency 
resulting from eosinopenia, basopenia, lymphopenia, 
10 monocytopenia, neutropenia, anemias, thrombocytopenia, or 
stem-cell insufficiencies therefor; or (c) cardiac muscle 
degeneration or insufficiency resulting from 
cardiomyopathy or congestive heart failure. 

26. A method for preventing or treating cellular 
degeneration or insufficiency in an individual comprising 
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implanting into the individual, cells that express the 
growth factor of claim 1. 

27. A method for detecting the presence of a 
growth factor in a sample from a patient comprising 
detecting and/or quantitating the presence in the sample 
of mRNA encoding a growth factor of claim 1. 

28. A method for detecting persephin gene 
alterations comprising detecting the presence of a non- 
intact persephin gene in a cell wherein presence of the 
non- intact gene indicates the presence of gene 

5 alterations • 

29. A method for promoting the growth and/or 
differentiation of a cell in a culture medium comprising 
adding to the culture medium the growth factor of claim 
1. 

30. An isolated and purified persephin antisense 
polynucleotide comprising a sequence complementary to a 
nucleic acid sequence of claim 14 and capable of 
hybridizing to a naturally-occurring DNA or mRNA 

5 polynucleotide sequence encoding persephin to prevent 

transcription and/or translation of an encoded persephin 
polypeptide . 

31- A method for treating a disease condition 
mediated by expression of persephin by a population of 
cells comprising administering to said cells an 
inhibitory effective amount of the antisense 

5 polynucleotide of claim 30. 
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